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1^. (57) Abstract: This invention relates to genes identified from human chromosome 20pl3-pl2, which are associated with various 
— . diseases, including asthma. The invention also relates to the nucleotide sequences of these genes, isolated nucleic acids comprising 
these nucleotide sequences, and isolated polypeptides or peptides encoded thereby. The invention further relates to vectors and host 
cells comprising the disclosed nucleotide sequences, or fragments thereof, as well as antibodies that bind to the encoded polypeptides 
or peptides. Also related are ligands that modulate the activity of the disclosed genes or gene products. In addition, the invention 
relates to methods and compositions employing the disclosed nucleic acids, polypeptides or peptides, antibodies, and/or ligands for 

1^" use in diagnostics and therapeutics for asthma and other diseases. 
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NOVEL HUMAN GENE RELATING TO RESPIRATORY DISEASES, 
OBESITY, AND INFLAMMATORY BOWEL DISEASE 



FIELD OF THE INVENTION 

10 This invention relates to genes identified from human chromosome 

20p13-p12, including Gene 216, which are associated with asthma, obesity, 
inflammatory bowel disease, and other human diseases. The invention also 
relates to the nucleotide sequences of these genes, including genomic DNA 
sequences, cDNA sequences, and single nucleotide polymorphisms. The 

1 5 invention further relates to isolated nucleic acids comprising these nucleotide 
sequences, and isolated polypeptides or peptides encoded thereby. Also 
related are expression vectors and host cells comprising the disclosed nucleic 
acids or fragments thereof, as well as antibodies that bind to the encoded 
polypeptides or peptides. The present invention further relates to ligands that 

20 modulate the activity of the disclosed genes or gene products. In addition, the 
invention relates to diagnostics and therapeutics for various diseases, including 
asthma, utilizing the disclosed nucleic acids, polypeptides or peptides, 
antibodies, and/or ligands. 
BACKGROUND 

25 Mouse chromosome 2 has been linked to a variety of disorders including 

airway hyperesponsiveness and obesity (DeSanctis et a(., 1995, Nature 
Genetics, 11:150-154; Nagle et al., 1999, Nature, 398:148-152). This region 
of the mouse genome is homologous to portions of human chromosome 20 
including 20p13-p12. Although human chromosome 20p13-12p has been 

30 linked to a variety of genetic disorders including diabetes insipidus, 
neurohypophyseal, congenital endothelial dystrophy of cornea, insomnia, 
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neurodegeneration with brain iron accumulation 1 (Hallervorden-Spatz 
syndrome), fibrodysplasia ossificans progressiva, alagille syndrome, 
hydrometrocolpos (McKusick-Kaufman syndrome), Creutzfeldt- Jakob disease 
and Gerstmann-Straussler disease (see NCBI; National Center for 
5 Biotechnology information, National Library of Medicine, 38A, 8N905, 8600 
Rockville Pike, Bethesda, MD 20894; wwwmcbi.nlm.nih.gov) the genes 
affecting these disorders have yet to be discovered. There is a need in the art 
for identifying specific genes relating to these disorders, as well as genes 
associated with obesity, lung disease, particularly, inflammatory lung disease 

10 phenotypes such as Chronic Obstructive Lung Disease (COPD), Adult 
Respiratory Distress Syndrome (ARDS), and asthma. Identification and 
characterization of such genes will make possible the development of effective 
diagnostics and therapeutic means to treat lung-related disorders. 
SUMMARY OF THE INVENTION 

15 This invention relates to Gene 216 located on human chromosome 

20p13-p12. In specific embodiments, the invention relates to isolated nucleic 
acids comprising Gene 216 genomic sequences (e.g., SEQ ID NO:5 and SEQ 
ID NO:6), cDNA sequences (e.g., SEQ ID NO:1 and SEQ ID NO:3), 
complementary sequences, sequence variants, or fragments thereof, as 

20 described herein. The present invention also encompasses nucleic acid 
probes or primers useful for assaying a biological sample for the presence or 
expression of Gene 216. The invention further encompasses nucleic acids 
variants comprising single nucleotide polymorphisms (SNPs) identified in 
several genes, including Gene 216 (e.g., SEQ ID NO:241-288). Such SNPs 

25 can be used to diagnose diseases such as asthma, or to determine a genetic 
predisposition thereto. In addition, the present invention encompasses nucleic 
acids comprising alternate splicing variants (e.g., SEQ ID NO:2 and SEQ ID 
NO:350-362). 

This invention also relates to vectors and host cells comprising vectors 
30 comprising the Gene 216 nucleic acid sequences disclosed herein. Such 
vectors can be used for nucleic acid preparations, including antisense nucleic 
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acids, and for the expression of encoded polypeptides or peptides. Host cells 
can be prokaryotic or eukaryotic cells. In specific embodiments, an expression 
vector comprises a DNA sequence encoding the Gene 216 polypeptide 
sequence (e.g., SEQ ID NO:4 or SEQ ID NO:363), sequence variants, or 
5 fragments thereof, as described herein. 

The present invention further relates to isolated Gene 216 polypeptides 
and peptides. In specific embodiments, the polypeptides or peptides comprise 
the amino acid sequence of the Gene 216 (e.g., SEQ ID NO:4 or SEQ ID 
NO:363), sequence variants, or portions thereof, as described herein. In 

10 addition, this invention encompasses isolated fusion proteins comprising Gene 
216 polypeptides or peptides. 

The present invention also relates to isolated antibodies, including 
monoclonal and polyclonal antibodies, and antibody fragments, that are 
specifically reactive with the Gene 216 polypeptides, fusion proteins, or 

15 variants, or portions thereof, as disclosed herein. In specific embodiments, 
monoclonal antibodies are prepared to be specifically reactive with the Gene 
216 polypeptide (e.g., SEQ ID NO:4 or SEQ ID NO;363) or peptides, or 
sequence variants thereof. 

In addition, the present invention relates to methods of obtaining Gene 

20 216 polynucleotides and polypeptides, variant sequences, or fragments 
thereof, as disclosed herein. Also related are methods of obtaining anti-Gene 
216 antibodies and antibody fragments. The present invention also 
encompasses methods of obtaining Gene 216 ligands, e.g., agonists, 
antagonists, inhibitors, and binding factors. Such ligands can be used as 

25 therapeutics for asthma and related diseases. 

The present invention also relates to diagnostic methods and kits 
utilizing Gene 216 (wild-type, mutant, or variant) nucleic acids, polypeptides, 
antibodies, or functional fragments thereof. Such factors can be used, for 
example, in diagnostic methods and kits for measuring expression levels of 

30 Gene 216, and to screen for various Gene 216-related diseases, especially 
asthma. In addition, the nucleic acids described herein can be used to identify 
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chromosomal abnormalities affecting Gene 216, and to identify allelic variants 
or mutations of Gene 216 in an individual or population. 

The present invention further relates to methods and therapeutics for 
the treatment of various diseases, including asthma. In various embodiments, 
5 therapeutics comprising the disclosed Gene 216 nucleic acids, polypeptides, 
antibodies, ligands, or variants, derivatives, or portions thereof, are 
administered to a subject to treat, prevent or ameliorate asthma. Specifically 
related are therapeutics comprising Gene 216 antisense nucleic acids, 
monoclonal antibodies, metalloprotease inhibitors, and gene therapy vectors. 

10 Such therapeutics can be administered alone, or in combination with one or 
more asthma treatments. 

In addition, this invention relates to non-human transgenic animals and 
cell lines comprising one or more of the disclosed Gene 216 nucleic acids, 
which can be used for drug screening, protein production, and other purposes. 

1 5 Also related are non-human knock-out animals and cell lines, wherein one or 
more endogenous Gene 216 genes (i.e., orthoiogs), or portions thereof, are 
deleted or replaced by marker genes. 

This invention further relates to methods of identifying proteins that are 
candidates for being involved in asthma (i.e., a "candidate protein"). Such 

20 proteins are identified by a method comprising: 1 ) identifying a protein in a first 
individual having the asthma phenotype; 2) identifying a protein in a second 
individual not having the asthma phenotype; and 3) comparing the protein of 
the first individual to the protein of the second individual, wherein a) the protein 
that is present in the second individual but not the first individual is the 

25 candidate protein; or b) the protein that is present in a higher amount in the 
second individual than in the first individual is the candidate protein; or c) the 
protein that is present in a lower amount in the second individual than in the 
first individual is the candidate protein. 
BRIEF DESCRIPTION OF THE FIGURES 

30 Figure 1 depicts the LOD Plot of Linkage to Asthma. 

Figure 2 depicts the LOD Plot of Linkage to BHR (PC20 <=4 mg/ml) & 
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Asthma. 

Figure 3 depicts the LOD Plot of Linkage to BHR (PC20 <=16 mg/ml) 
& Asthma 

Figure 4 depicts the LOD Plot of Linkage to High Total IgE & Asthma 
5 Figure 5 depicts the LOD Plot of Linkage to High Specific IgE & Asthma 

Figure 6 depicts the BAC/STS content contig map of human 
chromosome 20p 1 3-p 12. 

Figure 7 depicts the BAC1098L22 nucleotide sequence (SEQ JD NO:5). 

Figure 8 depicts the locations of single nucleotide polymorphisms, 
1 0 corresponding amino acid changes, and domains in the Gene 216 transcript. 
The exons of the transcript are marked from A to T and the size of each one 
is indicated. Above the exons, the 8 domains are labeled and a black bar 
represents the approximate location of each one. Underneath the black bars 
are the approximate location of the amino acid changes that have been 
15 identified. The amino acids boxed in white are the alleles that are most 
frequently observed. The nucleotides boxed in gray are the alleles that are 
most frequently observed. Single nucleotide polymorphisms are unboxed, and 
the polymorphism names appear underneath. The uterus cDNA clone does 
not contain all of Exon A, and does not contain the sequence CAG between 
20 Exon S and T. 

Figure 9 depicts alternate splice variants of Gene 216 obtained from 
lung tissue, including rt672 (SEQ ID NO:350), rt690 (SEQ ID NO:351), rt709 
(SEQ ID NO:352), rt71 1 (SEQ ID NO:353), rt713 (SEQ ID NO:354), and rt720 
(SEQ ID NO:355). 

25 Figure 10 depicts alternate splice variants of Gene 216 obtained from 

lung tissue, including rt725 (SEQ ID NO:356), rt727 (SEQ ID NO:357), rt733 
(SEQ ID NO:358), rt735 (SEQ ID NO:359), rt764 (SEQ ID NO:360), rt772 
(SEQ ID NO:361), and rt774 (SEQ ID NO:362). 

Figure 11 depicts the structure of the genomic sequence of Gene 216. 

30 Figure 12 depicts the alternate AG splice sequences at the junction of 

Intron ST and Exon T in Gene 216. 
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Figure 13 depicts the promoter region of Gene 216. The Gene 216 
promoter sequence is shown in SEQ ID NO:8; the Gene 216 enhancer 
sequence is shown in SEQ ID NO:7. 

Figure 14 depicts a dendrogram of the ADAM family members and the 
5 relationship of Gene 21 6 to ADAMs that possesses an active metalloprotease 
domain. 

Figures 15A-15C depict Northern Blots illustrating Gene 216 expression 
patterns. Figures 15A-15B show Gene 216 expression in various tissue types. 
Figure 15C shows Gene 216 expression in bronchial smooth muscle tissue. 
1 0 Figure 1 6 depicts a Dot Blot that shows Gene 21 6 expression in various 

tissue types. 

Figure 17 depicts RT-PCR analysis of Gene 216 expression in primary 
cells from lung tissue. 

Figure 18 depicts an amino acid sequence alignment (Pileup) of 5 

15 ADAM family members that are closely related to Gene 216. Amino acids 
highlighted in black show 100% identity within the Pileup; dark gray show 80% 
identity; and light gray show 60% identity. The boxed amino acids represent 
the cysteine switch, the metalloprotease domain, and the "met-turn". The 
labeled arrows show the locations of the 8 domains. 

20 Figure 19 depicts the amino acid sequence of Gene 216 (SEQ ID 

NO:4). Labeled arrows above the sequence denote domain and corresponding 
length. Black boxes represent the signal sequence and the transmembrane 
domain identified by hydrophobicity plots. The underlined cysteine residue at 
position 133 is predicted to be involved in the cysteine switch, the dashed box 

25 represents the metalloprotease domain, and the methionine underlined twice 
is the "met-turn". The gray boxes represent the signaling binding sites 
identified in the cytoplasmic tail. The amino acid changes corresponding to 
single nucleotide polymorphisms are indicated in bold. The alanine deleted in 
the uterus cDNA clone is marked within a black triangle, and if present would 

30 have been between the glutamine and the aspartic acid. 

Figure 20 depicts the Kyte-Doolittle hydrophobicity plot for the Gene 
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216 amino acid sequence. 

Figures 21 depicts the genomic sequence of the mouse ortholog of 
Gene 21 6. (SEQ ID NO:364). 

Figure 22 depicts the cDNA nucleotide sequence (SEQ ID NO:364) and 
5 predicted amino acid sequence (SEQ ID NO:365) of the mouse ortholog of 
Gene 216. 

Figure 23 depicts an amino acid sequence alignment (Pileup) of human 
Gene 216 polypeptide (SEQ ID NO:4) and the mouse ortholog of Gene 216 
(SEQ ID NO:366). Vertical lines indicate identical amino acid residues. Dots 
10 indicate similar amino acid residues. 

Figure 24 depicts the nucleotide sequence (SEQ ID NO:1) and encoded 
amino acid sequence (SEQ ID NO:4) determined from the master cDNA 
sequence of Gene 21 6. The master cDNA sequence combines the sequence 
information from the uterine cDNA clone and 5'RACE clone. Identified single 
15 nucleotide polymorphism positions are underlined. 

Figure 25 depicts the results of a case control study p-value plot that 
shows single nucleotide polymorphism association with the asthma phenotype 
in the combined US and UK populations. 

Figure 26 depicts the results of a case control study p-value plot that 
20 shows single nucleotide polymorphism association with the asthma phenotype 
in the US and UK populations, separately. 

Figure 27 depicts the results of a case control study p-value plot that 
shows single nucleotide polymorphism association with the bronchial hyper- 
responsiveness and asthma phenotypes in the US and UK combined 
25 population. 

Figure 28 depicts the results of a case contfof study p-value plot that 
shows single nucleotide polymorphism association with the bronchial hyper- 
responsiveness and asthma phenotypes in the US and UK populations, 
separately. 

30 Figure 29 depicts the genomic nucleotide sequence (SEQ ID NO:6) 

determined for Gene 216. Identified single nucleotide polymorphism positions 
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are underlined. 

Figure 30 depicts the nucleotide sequence (SEQ ID NO:3) and encoded 
amino acid sequence (SEQ ID NO: 363) of Gene 216 determined from the 
uterus cDNA clone. Identified single nucleotide polymorphism positions are 
5 underlined. 

Figure 31 depicts the nucleotide sequence (SEQ ID NO:350) and 
encoded amino acid sequence (SEQ ID NO:337) of Gene 216 alternate splice 
variant rt672. 

Figure 32 depicts the nucleotide sequence (SEQ ID NO:351) and 
1 0 encoded amino acid sequence (SEQ ID NO:338) of Gene 21 6 alternate splice 
variant rt690. 

Figure 33 depicts the nucleotide sequence (SEQ ID NO:352) and 
encoded amino acid sequence (SEQ ID NO:339) of Gene 216 alternate splice 
variant rt709. 

15 Figure 34 depicts the nucleotide sequence (SEQ ID NO:353) and 

encoded amino acid sequence (SEQ ID NO:340) of Gene 216 alternate splice 
variant rt711. 

Figure 35 depicts the nucleotide sequence (SEQ ID NO:354) and 
encoded amino acid sequence (SEQ ID NO:341 ) of Gene 216 alternate splice 
20 variant rt71 3. 

Figure 36 depicts the nucleotide sequence (SEQ ID NO:355) and 
encoded amino acid sequence (SEQ ID NO:342) of Gene 216 alternate splice 
variant rt720. 

Figure 37 depicts the nucleotide sequence (SEQ ID NO:356) and 
25 encoded amino acid sequence (SEQ ID NOr343) of Gene 216 alternate splice 
variant rt725. 

Figure 38 depicts the nucleotide sequence (SEQ ID NO:357) and 
encoded amino acid sequence (SEQ ID NO:344) of Gene 216 alternate splice 
variant rt727. 

'30 Figure 39 depicts the nucleotide sequence (SEQ ID NO:358) and 

encoded amino acid sequence (SEQ ID NO:345) of Gene 216 alternate splice 
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variant rt733. 

Figure 40 depicts the nucleotide sequence (SEQ ID NO:359) and 
encoded amino acid sequence (SEQ ID NO:346) of Gene 216 alternate splice 
variant rt735. 

5 Figure 41 depicts the nucleotide sequence (SEQ ID NO:360) and 

encoded amino acid sequence (SEQ ID NO:347) of Gene 216 alternate splice 
variant rt764. 

Figure 42 depicts the nucleotide sequence (SEQ ID NO:361) and 
encoded amino acid sequence (SEQ ID NO:348) of Gene 216 alternate splice 
10 variant rt772. 

Figure 43 depicts the nucleotide sequence (SEQ ID NO:362) and 
encoded amino acid sequence (SEQ ID NO:349) of Gene 216 alternate splice 
variant rt774. 

DETAILED DESCRIPTION OF THE INVENTION 

15 Gene 216 was identified by extensive analysis of the region of human 

chromosome 20p13~p12 associated with airway hyperresponsiveness, asthma, 
and atopy. This region has also been implicated in other diseases such as 
obesity (Wilson, 1999, Arch. Intern. Med. 159:2513-4). Bronchial asthma, 
furthermore, has been linked to intestinal conditions such as inflammatory 

20 bowel disease (B. Wallaert et al., 1995, J. Exp. Med. 182:1897-1904). Thus, 
there was a need to identify and isolate the gene(s) associated with this region 
of human chromosome 20. 
Definitions 

To aid in the understanding of the specification and claims, the following 
25 definitions are provided. 

"Disorder region" refers to a portion of the human chromosome 20 
bounded by the markers D20S502 and D20S851. A "disorder-associated" 
nucleic acid or polypeptide sequence refers to a nucleic acid sequence that 
maps to region 20p13-p12 or the polypeptides encoded therein (e.g., Gene 
30 216 nucleic acids, and polypeptides). For nucleic acids, this encompasses 
sequences that are identical or complementary to the Gene 216 sequence, as 
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well as sequence-conservative, function-conservative, and non -conservative 
variants thereof. For polypeptides, this encompasses sequences that are 
identical to the Gene 216 polypeptide, as well as function-conservative and 
non-conservative variants thereof. Included are naturally-occurring mutations 
5 of Gene 216 causative of respiratory diseases or obesity, such as but not 
limited to mutations which cause altered protein levels or stability (e.g., 
decreased levels, increased levels, expression in an inappropriate tissue type, 
increased stability, and decreased stability). 

As used herein, the "reference sequence" for Gene 216 is BAC1098L22 

10 (SEQ ID NO:5). The BAC1098L22 sequence is also the source of the 
disclosed Gene 216 genomic sequence (SEQ ID NO:6). "Variant" sequences 
refer to nucleotide sequences (and the encoded amino acid sequences) that 
differ from the reference sequence at one or more positions. Non-limiting 
examples of variant sequences include the disclosed Gene 216 single 

15 nucleotide polymorphisms (SNPs), alternate splice variants, and the amino 
acid sequences encoded by these variants. 

"Sequence-conservative" variants are those in which a change of one 
or more nucleotides in a given codon position results in no alteration in the 
amino acid encoded at that position (i.e., silent mutations). "Function- 

20 conservative" variants are those in which a change in one or more nucleotides 
in a given codon position results in a polypeptide sequence in which a given 
amino acid residue in the polypeptide has been replaced by a conservative 
amino acid substitution as described in detail herein. "Function-conservative" 
variants also include analogs of a given polypeptide and any polypeptides that 

25 have the ability to elicit antibodies specific to a designated polypeptide. "Non- 
conservative" variants are those in which a change in one or more nucleotides 
in a given codon position results in a polypeptide sequence in which a given 
amino acid residue in a polypeptide has been replaced by a non-conservative 
amino acid substitution as described hereinbelow. "Non-conservative" variants 

30 also include polypeptides comprising non-conservative amino acid 
substitutions. 
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As used herein, the term "ortholog" denotes a gene or polypeptide 
obtained from one species that has homology to an anafogous gene or 
polypeptide from a different species. The term "paralog" denotes a gene or 
polypeptide obtained from a given species that has homology to a distinct gene 
5 or polypeptide from that same species. For example, the disclosed mouse and 
human Gene 216 sequences are orthologs, whereas human Gene 216 and 
human ADAM 19 are paralogs. 

"Nucleic acid or "polynucleotide" as used herein refers to purine- and 
pyrimidine-containing polymers of any length, either polyribonucleotides or 

10 polydeoxyribonucleotide or mixed polyribo-polydeoxyribonucleotides. This 
includes single-and double-stranded molecules, i.e., DNA-DNA, DNA-RNA 
and RNA-RNA hybrids, as well as "protein nucleic acids" (PNA) formed by 
conjugating bases to an amino acid backbone. This also includes nucleic 
acids containing modified bases. 

15 As used herein, "isolated" nucleic acids are nucleic acids separated 

away from other components (e.g., DNA, RNA, and protein) with which they 
are associated (e.g., as obtained from cells, chemical synthesis systems, or 
phage or nucleic acid libraries). Isolated nucleic acids are at least 60% free, 
preferably 75% free, and most preferably 90% free from other associated 

20 components. In accordance with the present invention, isolated nucleic acids 
can be obtained by methods described herein, or other established methods, 
including isolation from natural sources (e.g., cells, tissues, or organs), 
chemical synthesis, recombinant methods, combinations of recombinant and 
chemical methods, and library screening methods. 

25 Nucleic acids referred to herein as "recombinant" are nucleic acids 

which have been produced by recombinant DNA methodology, including those 
nucleic acids that are generated by procedures which rely upon a method of 
artificial replication, such as the polymerase chain reaction (PCR) and/or 
cloning into a vector using restriction enzymes. Portions of recombinant 

30 nucleic acids which code for polypeptides can be identified and isolated by, for 
example, the method of M. Jasin et al., U.S. Patent No. 4,952,501 . 
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A "coding sequence" or a "protein-coding sequence" is a polynucleotide 
sequence capable of being transcribed into mRNA and/or capable of being 
translated into a polypeptide or peptide. The boundaries of the coding 
sequence are typically determined by a translation start codon at the 5*- 
5 terminus and a translation stop codon at the S'-terminus. 

A "complement" of a nucleic acid sequence as used herein refers to the 
"antisense" sequence that participates in Watson-Crick base-pairing with the 
original sequence. 

A "probe" or "primer" refers to a nucleic acid or oligonucleotide that 
10 forms a hybrid structure, with a sequence in a target region due to 
complementarily of the probe or primer sequence to at least one portion of the 
target region sequence. 

Nucleic acids are "hybrid izable" to each other when at least one strand 
of the nucleic acid can anneal to another nucleic acid strand under defined 
1 5 stringency conditions. Hybridization requires that the two nucleic acids contain 
substantially complementary sequences; depending on the stringency of 
hybridization, however, mismatches may be tolerated. The appropriate 
stringency for hybridizing nucleic acids depends on the length of the nucleic 
acids and the degree of complementarily, and can be determined in 
20 accordance with the methods described herein. 

As used herein, "portion" and 'fragment" are synonymous. A "portion" 
as used with regard to a nucleic acid or polynucleotide, refers to fragments of 
that nucleic acid or polynucleotide. The fragments can range in size from 8 
nucleotides to all but one nucleotide of the entire Gene 216 sequence. 
25 Preferably, The fragments are at least 8 to 10 nucleotides in length; more 
preferably at least 12 nucleotides in length; still more preferably at least 15 to 
20 nucleotides in length; yet more preferably at least 25 nucleotides in length; 
and most preferably at least 35 to 55 nucleotides in length. 

"cDNA" refers to complementary or copy DNA produced from an RNA 
30 template by the action of RNA-dependent DNA polymerase (reverse 
transcriptase). Thus, a "cDNA clone" means a duplex DNA sequence 
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complementary to an RNA molecule of interest, included in a cloning vector or 
PCR amplified. This term includes genes from which the intervening 
sequences have been removed. 

"Cloning" refers to the use of recombination techniques to insert a 
5 particular gene or other DNA sequence into a vector molecule. In order to 
successfully clone a desired gene, it is necessary to use methods for 
generating DNA fragments, for joining the fragments to vector molecules, for 
introducing the composite DNA molecule into a host cell in which it can 
replicate, and for selecting the clone having the target gene from amongst the 

10 recipient host cells. 

"cDNA library" refers to a collection of recombinant DNA molecules 
containing cDNA inserts that together comprise essentially all of the expressed 
genes of an organism. A cDNA library can be prepared by methods known to 
one skilled in the art (see, e.g., Cowell and Austin, 1997, "cDNA Library 

15 Protocols," Methods in Molecular Biology). Generally, RNA is first isolated 
from the cells of the desired organism, and the RNA is used to prepare cDNA. 
molecules. 

"Cloning vector" refers to a plasmid or phage DNA or other DNA that is 
able to replicate in a host cell. The cloning vector is typically characterized by 

20 one or more endonuclease recognition sites at which such DNA sequences 
may be cut in a determinable fashion without loss of an essential biological 
function of the DNA, which may contain a marker suitable for use in the 
identification of cells containing the vector. 

"Regulatory sequence" refers to a nucleic acid sequence that controls 

25 or regulates expression of structural genes when operably linked to those 
genes. These include, for example, the lac systemic the tip system, major 
operator and promoter regions of the phage lambda, the control region of fd 
coat protein and other sequences known to control the expression of genes in 
prokaryotic or eukaryotic cells. Regulatory sequences will vary depending on 

30 whether the vector is designed to express the operably linked gene in a 
prokaryotic or eukaryotic host, and may contain transcriptional elements such 
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as enhancer elements, termination sequences, tissue-specificity elements 
and/or translational initiation and termination sites. 

"Expression vector" refers to a vehicle or pi asm id that is capable of 
expressing a gene that has been cloned into it, after transformation or 
5 integration in a host cell. The cloned gene is usually placed under the control 
of (i.e., operably linked to) a regulatory sequence. 

"Operably linked" means that the promoter controls the initiation of 
expression of the gene. A promoter is operably linked to a sequence of 
proximal DNA if upon introduction into a host ceil the promoter determines the 

10 transcription of the proximal DNA sequence(s) into one or more species of 
RNA. A promoter is operably linked to a DNA sequence if the promoter is 
capable of initiating transcription of that DNA sequence. 

"Host" includes prokaryotes and eukaryotes. The term includes an 
organism or cell that is the recipient of an expression vector (e.g., 

1 5 autonomously replicating or integrating vector). 

"Amplification" of nucleic acids refers to methods such as polymerase 
chain reaction (PCR), ligation amplification (orligase chain reaction, LCR) and 
amplification methods based on the use of Q-beta replicase. These methods 
are well known in the art and described, for example, in U.S. Patent Nos. 

20 4,683,195 and 4,683,202. Reagents and hardware for conducting PCR are 
commercially available. Primers useful for amplifying sequences from the 
disorder region are preferably complementary to, and preferably hybridize 
specifically to, sequences in the 20p13-p12 region or in regions that flank a 
target region therein. Gene 21 6 generated by amplification may be sequenced 

25 directly. Alternatively, the amplified sequence(s) may be cloned prior to 
sequence analysis. 

"Gene" refers to a DNA sequence that encodes through its template or 
messenger RNA a sequence of amino acids characteristic of a specific peptide, 
polypeptide, or protein. The term "gene" as used herein with reference to 

30 genomic DNA includes intervening, non-coding regions, as well as regulatory 
regions, and can include 5' and 3' ends. 
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A gene sequence is "wild-type" if such sequence is usually found in 
individuals unaffected by the disease or condition of interest. However, 
environmental factors and other genes can also play an important role in the 
ultimate determination of the disease. In the context of complex diseases 
5 involving multiple genes ("oligogenic disease"), the "wild type", or normal 
sequence can also be associated with a measurable risk or susceptibility, 
receiving its reference status based on its frequency in the general population. 
As used herein, "wild-type Gene 21 6" refers to the reference sequence, 
BAC1 098L22 (SEQ ID NO:5). The wild-type Gene 216 sequence was used to 
10 identify the variants (single nucleotide polymorphisms) described in detail 
herein. 

A gene sequence is a "mutant" sequence if it differs from the 
wild-type sequence. For example, a Gene 216 nucleic acid containing a single 
nucleotide polymorphism is a mutant sequence. In some cases, the individual 

1 5 carrying such gene has increased susceptibility toward the disease or condition 
of interest. In other cases, the "mutant" sequence might also refer to a 
sequence that decreases the susceptibilty toward a disease or condition of 
interest, and thus acting in a protective manner. Also a gene is a "mutant" 
gene if too much ("overexpressed") or too little ("underexpressed") of such 

20 gene is expressed in the tissues in which such gene is normally expressed, 
thereby causing the disease or condition of interest. 

A nucleic acid or fragment thereof is "substantially homologous" to 
another if, when optimally aligned (with appropriate nucleotide insertions and/or 
deletions) with the other nucleic acid (or its complementary strand), there is 

25 nucleotide sequence identity in at least 60% of the nucleotide bases, usually 
at least 70%, more usually at least 80%, preferably at least 90%, and more 
preferably at least 95-98% of the nucleotide bases. 

Alternatively, substantial homology exists when a nucleic acid or 
fragment thereof will hybridize, under selective hybridization conditions, to 

30 another nucleic acid (or a complementary strand thereof). Selectivity of 
hybridization exists when hybridization which is substantially more selective 
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than total lack of specificity occurs. Typically, selective hybridization will occur 
when there is at least about 55% sequence identity over a stretch of at least 
about nine or more nucleotides, preferably at least about 65%, more preferably 
at least about 75%, and most preferably at least about 90% (M. Kanehisa, 
5 1984, /Vac/. Acids Res. 11:203-213). The length of homology comparison, as 
described, may be over longer stretches, and in certain embodiments will often 
be over a stretch of at least 14 nucleotides, usually at least 20 nucleotides, 
more usually at least 24 nucleotides, typically at least 28 nucleotides, more 
typically at least 32 nucleotides, and preferably at least 36 or more nucleotides. 
10 As used herein, the terms "protein" and "polypeptide" are synonymous. 

"Peptides" are defined as fragments or portions of polypeptides, preferably 
fragments or portions having at least one functional activity (e.g., proteolysis, 
adhesion, fusion, antigenic, or intracellular activity) as the complete polypeptide 
sequence. 

15 "Isolated" polypeptides or peptides are those that are separated from 

other components (e.g., DNA, RNA, and other polypeptides or peptides) with 
which they are associated (e.g., as obtained from cells, translation systems, or 
chemical synthesis systems). In a preferred embodiment, isolated 
polypeptides or peptides are at least 10% pure; more preferably, 80 or 90% 

20 pure. Isolated polypeptides and peptides include those obtained by methods 
described herein, or other established methods, including isolation from natural 
sources (e.g., cells, tissues, or organs), chemical synthesis, recombinant 
methods, or combinations of recombinant and chemical methods. Proteins or 
polypeptides referred to herein as "recombinant" are proteins or polypeptides 

25 produced by the expression of recombinant nucleic acids. 

A "portion" as used herein with regard to a protein or polypeptide, refers 
to fragments of that protein or polypeptide. The fragments can range in size 
from 5 amino acid residues to all but one residue of the entire protein 
sequence. Thus, a portion or fragment can be at least 5, 5-50, 50-100, 100- 

30 200, 200-400, 400-800, or more consecutive amino acid residues of a Gene 
216 protein or polypeptide, for example, SEQ ID NO:4 or SEQ ID NO:363. 
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An "immunogenic component", is a moiety that is capable of eliciting a 
humoral and/or cellular immune response in a host animal. 

An "antigenic component" is a moiety that binds to its specific antibody 
with sufficiently high affinity to form a detectable antigen-antibody complex. 
5 A "sample" as used herein refers to a biological sample, such as, for 

example, tissue or fluid isolated from an individual (including, without limitation, 
plasma, serum, cerebrospinal fluid, lymph, tears, saliva, milk, pus, and tissue 
exudates and secretions) or from in vitro cell culture constituents, as well as 
samples obtained from, for example, a laboratory procedure. 

10 "Antibodies" refer to polyclonal and/or monoclonal antibodies and 

fragments thereof, and immunologic binding equivalents thereof, that can bind 
to asthma proteins and fragments thereof or to nucleic acid sequences from 
the 20p13-p12 region, particularly from the asthma locus or a portion thereof. 
The term antibody is used both to refer to a homogeneous molecular entity, 

15 or a mixture such as a serum product made up of a plurality of different 
molecular entities. Proteins may be prepared synthetically in a protein 
synthesizer and coupled to a carrier molecule and injected over several months 
into rabbits. Rabbit sera is tested for immunoreactivity to the protein or 
fragment. Monoclonaf antibodies may be made by injecting mice with the 

20 proteins, or fragments thereof. Monoclonal antibodies will be screened by 
ELISA and tested for specific immunoreactivity with protein or fragments 
thereof. (Harlow et al., 1988, Antibodies: A Laboratory Manual, Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY). These antibodies will be useful 
in assays as well as therapeutics. 

25 "Identity," as known in the art, is a relationship between two or more 

polypeptide sequences or two or more polynucleotide sequences, as 
determined by comparing the sequences. In the art, "identity" also means the 
degree of sequence relatedness between polypeptide or polynucleotide 
sequences, as the case may be, as determined by the match between strings 

30 of such sequences. "Identity" and "similarity" can be readily calculated by 
known methods, including but not limited to those described in (A.M. Lesk (ed), 
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1988, Computational Molecular Biology, Oxford University Press, NY; D.W. 
Smith (ed), 1993, Biocomputing. Informatics and Genome Projects, Academic 
Press, NY; A.M. Griffin and H.G. Griffin, H. G (eds), 1994, Computer Analysis 
of Sequence Data, Part !, Humana Press, NJ; G. von Heinje, 1987, Sequence 
5 Analysis in Molecular Biology, Academic Press; and M. Gribskov and J. 
Devereux (eds), 1991, Sequence Analysis Primer, M Stockton Press, NY; H. 
Carillo and D. Lipman, 1988, SIAM J. Applied Math., 48:1073. 

Technical and scientific terms used herein have the meanings 
commonly understood by one of ordinary skill in the art to which the present 
10 invention pertains, unless^ otherwise defined. Reference is made herein to 
various methodologies known to those of skill in the art. Publications and other 
materials setting forth such known methodologies to which reference is made 
are incorporated herein by reference in their entireties as though set forth in 
full. 

15 Standard reference works setting forth the general principles of 

recombinant DNA technology include J. Sambrook et al., 1989, Molecular 
Cloning: A Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY; P.B. Kaufman et al., (eds), 1995, Handbook of 
Molecular and Cellular Methods in Biology and Medicine, CRC Press, Boca 

20 Raton; M.J. McPherson (ed), 1991, Directed Mutagenesis: A Practical 
Approach, IRL Press, Oxford; J. Jones, 1992, Amino Acid and Peptide 
Synthesis, Oxford Science Publications, Oxford; B.M. Austen and O.M.R. 
Westwood, 1991, Protein Targeting and Secretion, IRL Press, Oxford; D.N 
Glover (ed), 1985, DNA Cloning, Volumes I and II; M.J. Gait (ed), 1984, 

25 Oligonucleotide Synthesis', B.D. Hames and SJ. Higgins (eds), 1984, Nucleic 
Acid Hybridization) Wu and Grossman (eds), Methods in Enzymoloqy 
(Academic Press, Inc.), Vol. 154 and Vol. 155; Quirke and Taylor (eds), 1991 , 
PCR-A Practical Approach', Hames and Higgins (eds), 1984, Transcription and 
Translation; R.I. Freshney (ed), 1986, Animal Cell Culture; Immobilized Cells 

30 and Enzymes, 1986, IRL Press; Perbai, 1984, A Practical Guide to Molecular 
Cloning; J. H. Miller and M. P. Calos (eds), 1987, Gene Transfer Vectors for 
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Mammalian Cells, Cold Spring Harbor Laboratory Press; MJ. Bishop (ed), 
1998, Guide to Human Genome Computing, 2d Ed., Academic Press, San 
Diego, CA; L.F. Peruski and A.H. Peruski, 1997, The Internet and the New 
Biology: Tools for Genomic and Molecular Research, American Society for 
5 Microbiology, Washington, D.C. 

Standard reference works setting forth the general principles of 
immunology include S. Sell, 1996, Immunology, Immunopathology & Immunity, 
5th Ed., Appleton & Lange, Publ., Stamford, CT; D. Male et a!., 1996, 
Advanced Immunology, 3d Ed., Times Mirror Int'l Publishers Ltd., Publ., 

10 London; D.P. Stites and A.I. Terr, 1991, Basic and Clinical Immunology, 7th 
Ed., Appleton & Lange, Publ., Norwalk, CT; and A.K. Abbas et al., 1991, 
Cellular and Molecular Immunology, W. B. Saunders Co., Publ., Philadelphia, 
PA. Any suitable materials and/or methods known to those of skill can be 
utilized in carrying out the present invention; however, preferred materials 

15 and/or methods are described. Materials, reagents, and the like to which 
reference is made in the following description and examples are generally 
obtainable from commercial sources, and specific vendors are cited herein. 
Nucleic Acids 

The present invention relates to isolated Gene 216 nucleic acids 
20 comprising genomic DNA within BAC RPCM098L22 (e.g., SEQ ID NO:5), the 
corresponding cDNA sequences (e.g., SEQ ID NO:1 or SEQ ID NO:3), RNA, 
fragments of the genomic, cDNA, or RNA nucleic acids comprising 20, 40, 60, 
100, 200, 500 or more contiguous nucleotides, and the complements thereof. 
Closely related variants are also included as part of this invention, as well as 
25 nucleic acids sharing at least 50, 60, 70, 80, or 90% identity with the nucleic 
acids described above, and nucleic acids which would be identical to a Gene 
216 nucleic acids except for one or a few substitutions, deletions, or additions. 

The invention also relates to isolated nucleic acids comprising regions 
required for accurate expression of Gene 216 (e.g., Gene 216 promoter (e.g., 
30 SEQ ID NO:8), enhancer (e.g., SEQ ID NO:7), and polyadenylation 
sequences). In a preferred embodiment, the present invention is directed to 
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at least 15 contiguous nucleotides of the nucleic acid sequence of SEQ ID 
NO:1 or SEQ ID NO:6. More particularly, embodiments of this invention 
include the BAC clone containing segments of Gene 216 including 
RPCM098L22 as set forth in SEQ ID NO:5 (Figure 7). 
5 The invention further relates to nucleic acids (e.g., DNA or RNA) that 

hybridize to a) a nucleic acid encoding a Gene 216 polypeptide, such as a 
nucleic acid having the sequence of SEQ ID NO:1 or SEQ ID NO:6; b) 
sequence-conservative, function-conservative, and non-conservative variants 
of (a); and c) fragments or portions of (a) or (b). Nucleic acids that hybridize 

10 to the sequence of SEQ ID NO:1 or SEQ ID NO:6 can be double- or single- 
stranded. Hybridization to the sequence of SEQ ID NO:1 or SEQ ID NO:6 
includes hybridization to the strand shown or its complementary strand. 

The present invention also relates to nucleic acids that encode a 
polypeptide having the amino acid sequence of SEQ ID NO:4 or SEQ ID 

1 5 NO:363, or functional equivalents thereof. A functional equivalent of a Gene 
216 protein includes fragments or variants that perform at least on 
characteristic function of the Gene 216 protein (e.g., proteolysis, adhesion, 
fusion, antigenic, or intracellular activity). Preferably, a functional equivalent 
will share at least 65% sequence identity with the Gene 216 polypeptide. 

20 In preferred embodiments, nucleic acids of the present invention share 

at least 50%, preferably at least 60-70%, more preferably at least 70-80% 
sequence identity, and even more preferably at least 90-100% sequence 
identity with the sequences of SEQ ID NO:1 or SEQ ID NO:6, or fragments or 
portions thereof. Sequence identity calculations can be performed using 

25 computer programs, hybridization methods, or calculations. Preferred 
computer program methods to determine identity and similarity between two 
sequences include, but are not limited to, the GCG program package, 
BLASTN, BLASTX, TBLASTX, and FASTA (J. Devereuxet al., 1984, Nucleic 
Acids Research 12(1):387; S.F. Altschul et al., 1990, J. Molec. Biol. 215:403- 

30 410; W. Gish and D.J. States,-1994, Nature Genet 3:266-272; W.R. Pearson 
and D.J. Lipman, 1988, ProcNatL Acad. Sci. USA 85(8):2444-8). The BLAST 
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programs are publicly available from NCB1 and other sources . The well-known 
Smith Waterman algorithm may also be used to determine identity. 

For example, nucleotide sequence identity can be determined by 
comparing a query sequences to sequences in publicly available sequence 
5 databases (NCBI) using the BLASTN2 algorithm (S.F. Altschul et al., 1997, 
Nucl. Acids Res., 25:3389-3402). The parameters for a typical search are: E 
= 0.05, v = 50, B = 50, wherein E is the expected probability score cutoff, V is 
the number of database entries returned in the reporting of the results, and B 
is the number of sequence alignments returned in the reporting of the results 

10 (S.F. Altschul et aL, 1990, J. Mol. Biol, 215:403-410). 

In another approach, nucleotide sequence identity can be calculated 
using the following equation: % identity = (number of identical nucleotides) / 
(alignment length in nucleotides) * 100. For this calculation, alignment length 
includes internal gaps but not includes terminal gaps. Alternatively, nucleotide 

15 sequence identity can be determined experimentally using the specific 
hybridization conditions described below. 

In accordance with the present invention, polynucleotide alterations are 
selected from the group consisting of at least one nucleotide deletion, 
substitution, including transition and transversion, insertion, or modification 

20 (e.g., via RNA or DNA analogs). Alterations may occur at the 5' or 3' terminal 
positions of the reference nucleotide sequence or anywhere between those 
terminal positions, interspersed either individually among the nucleotides in the 
reference sequence or in one or more contiguous groups within the reference 
sequence. Alterations of a polynucleotide sequence of SEQ ID NO:1 or SEQ 

25 ID NO:6 may create nonsense, missense, or frameshift mutations in this coding 
sequence, and thereby alter the polypeptide encoded by the polynucleotide 
following such alterations. 

Such altered nucleic acids, including DNA or RNA, can be detected and 
isolated by hybridization under high stringency conditions or moderate 

30 stringency conditions, for example, which are chosen to prevent hybridization 
of nucleic acids having non-complementary sequences. "Stringency 
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conditions" for hybridizations is a term of art which refers to the conditions of 
temperature and buffer concentration which permit hybridization of a particular 
nucleic acid to another nucleic acid in which the first nucleic acid may be 
perfectly complementary to the second, or the first and second may share 
5 some degree of complementarity which is less than perfect. 

For example, certain high stringency conditions can be used which 
distinguish perfectly complementary nucleic acids from those of less 
complementarity. "High stringency conditions" and "moderate stringency 
conditions" for nucleic acid hybridizations are explained in F.M. Ausubel et al. 

1 0 (eds), 1 995, Current Protocols in Molecular Biology, John Wiley and Sons, Inc., 
New York, NY, the teachings of which are hereby incorporated by reference. 
In particular, see pages 2.10.1-2.10.16 (especially pages 2.10.8-2.10.11) and 
pages 6.3.1-6.3.6. The exact conditions which determine the stringency of 
hybridization depend not only on ionic strength, temperature and the 

1 5 concentration of destabilizing agents such as formamide, but also on factors 
such as the length of the nucleic acid sequence, base composition, percent 
mismatch between hybridizing sequences and the frequency of occurrence of 
subsets of that sequence within other non-identical sequences. Thus, high or 
moderate stringency conditions can be determined empirically. 

20 By varying hybridization conditions from a level of stringency at which 

no hybridization occurs to a level at which hybridization is first observed, 
conditions which will allow a given sequence to hybridize with the most similar 
sequences in the sample can be determined. Preferably the hybridizing 
sequences will have 60-70% sequence identity, more preferably 70-85% 

25 sequence identity, and even more preferably 90-100% sequence identity. 

Typically, the hybridization reaction is initially performed under 
conditions of low stringency, followed by washes of varying, but higher 
stringency. Reference to hybridization stringency, e.g., high, moderate, or low 
stringency, typically relates to such washing conditions. Hybridization 

30 conditions are based on the melting temperature (T m ) of the nucleic acid probe 
or primer and are typically classified by degree of stringency of the conditions 
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- under which hybridization is measured (Ausubel et al., 1995). For example, 
high stringency hybridization typically occurs at about 5-10% C below the T m ; 
moderate stringency hybridization occurs at about 10-20% below the T m ; and 
low stringency hybridization occurs at about 20-25% below the Tm. The melting 
5 temperature can be approximated by the formulas as known in the art, 
depending on a number of parameters, such as the length of the hybrid or 
probe in number of nucleotides, or hybridization buffer ingredients and 
conditions. As a general guide, T m decreases approximately 1°C with every 
1 % decrease in sequence identity at any given SSC concentration. Generally, 

1 0 doubling the concentration of SSC results in an increase in T m of ~1 7°C. Using 
these guidelines, the washing temperature can be determined empirically for 
moderate or low stringency, depending on the level of mismatch sought. 

High stringency hybridization conditions are typically carried out at 65 
to 68°C in 0.1 X SSC and 0.1% SDS. Highly stringent conditions allow 

15 hybridization of nucleic acid molecules having about 95 to 100% sequence 
identity. Moderate stringency hybridization conditions are typically carried out 
at 50 to 65°C in 1 X SSC and 0.1% SDS. Moderate stringency conditions allow 
hybridization of sequences having at least about 80 to 95% nucleotide 
sequence identity. Low stringency hybridization conditions are typically carried 

20 out at 40 to 50°C in 6 X SSC and 0.1% SDS. Low stringency hybridization 
conditions allow detection of specific hybridization of nucleic acid molecules 
having at least about 50 to 80% nucleotide sequence identity. 

For example, high stringency conditions can be attained by hybridization 
in 50% formamide, 5 X Denhardt's solution, 5 X SSPE or SSC (1 X SSPE 

25 buffer comprises 0.15 M NaCI, 10 mM Na 2 HP0 4 , 1 mM EDTA; 1 X SSC buffer 
comprises 150 mM NaCI, 15 mM sodium citrate, pH 7.0), 0.2% SDS at about 
42°C, followed by washing in 1 X SSPE or SSC and 0.1% SDS at a 
temperature of at least about 42°C, preferably about 55°C, more preferably 
about 65°C. Moderate stringency conditions can be attained, for example, by 

30 hybridization in 50% formamide, 5 X Denhardt's solution, 5 X SSPE or SSC, 
and 0.2% SDS at 42°C to about 50°C, followed by washing in 0.2 X SSPE or 
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SSC and 0.2% SDS at a temperature of at least about 42°C, preferably about 
55°C, more preferably about 65°C. Low stringency conditions can be attained, 
for example, by hybridization in 10% formamide, 5 X Denhardfs solution, 6 X 
SSPE or SSC, and 0.2% SDS at 42°C, followed by washing in 1 X SSPE or 
5 SSC, and 0.2% SDS at a temperature of about 45°C, preferably about 50°C 
in 4 X SSC at 60°C for 30 min. 

High stringency hybridization procedures typically (1) employ low ionic 
strength and high temperature for washing, such as 0.015 M NaCl/ 0.0015 M 
sodium citrate, pH 7.0 (0.1 X SSC) with 0.1% sodium dodecyl sulfate (SDS) at 

10 50°C; (2) employ during hybridization 50% (vol/vol) formamide with 5 X 
Denhardfs solution (0.1% weight/volume highly purified bovine serum 
albumin/0.1% wt/vol Ficoli/0.1% wt/vol polyvinylpyrrolidone), 50 mM sodium 
phosphate buffer at pH 6.5 and 5 X SSC at 42°C; or (3) employ hybridization 
with 50% formamide, 5 X SSC, 50 mM sodium phosphate (pH 6.8), 0.1% 

1 5 sodium pyrophosphate, 5 X Denhardfs solution, sonicated salmon sperm DNA 
(50 fxg/ml), 0,1 % SDS, and 10% dextran sulfate at 42°C, with washes at 42°C 
in 0.2 X SSC and 0.1% SDS. 

In one particular embodiment, high stringency hybridization conditions 
may be attained by: 

20 - Prehybridization treatment of the support (e.g. nitrocellulose filter 

or nylon membrane), to which is bound the nucleic acid capable of hybridizing 
with any of the sequences of the invention, is carried out at 65°C for 6 hr with 
a solution having the following composition: 4 X SSC, 10 X Denhardfs (1 X 
Denhardfs comprises 1% Ficoll, 1% polyvinylpyrrolidone, 1% BSA (bovine 

25 serum albumin); 1 X SSC comprises of 0.15 M of NaCl and 0.015 M of sodium 
citrate, pH 7); 

Replacement of the pre-hybridization solution in contact with the 
support by a buffer solution having the following composition: 4 X SSC, 1 X 
Denhardfs, 25 mM NaPQ 4 , pH 7, 2 mM EDTA, 0.5% SDS, 100 ug/ml of 
30 sonicated salmon sperm DNA containing a nucleic acid derived from the 
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sequences of the invention as probe, in particular a radioactive probe, and 
previously denatured by a treatment at 100°C for 3 min; 
Incubation for 12 hr at 65°C; 
Successive washings with the following solutions: 1 ) four washings with 
5 2 X SSC, 1 X Denhardt's, 0.5% SDS for 45 min at 65°C; 2) two washings with 
0.2 X SSC, 0.1 X SSC for 45 min at 65°C; and 3) 0.1 x SSC, 0.1% SDS for 45 
min at 65°C. 

Additional examples of high, medium, and low stringency conditions can 
be found in Sambrook et al., 1989. Exemplary conditions are also described 
10 in M.H. Krause and S.A. Aaronson, 1991, Methods in Enzymology, 200:546- 
556; Ausubel et al., 1995. It is to be understood that the low, moderate and 
high stringency hybridization/washing conditions may be varied using a variety 
of ingredients, buffers, and temperatures well known to and practiced by the 
skilled practitioner. 

1 5 Isolated nucleic acids that are characterized by their ability to hybridize 

to (a) a nucleic acid encoding a Gene 216 polypeptide, such as the nucleic 
acids depicted as SEQ ID NO:1 or SEQ ID NO:6, b) the complement of (a), (c) 
or a portion of (a) or (b) (e.g., under high or moderate stringency conditions), 
may further encode a protein or polypeptide having at least one function 

20 characteristic of a Gene 21 6 polypeptide, such as proteolysis, adhesion, fusion, 
and intracellular activity, or binding of antibodies that also bind to non- 
recombinant Gene 216 protein or polypeptide. The catalytic or binding function 
of a protein or polypeptide encoded by the hybridizing nucleic acid may be 
detected by standard enzymatic assays for activity or binding (e.g., assays that 

25 measure the binding of a transit peptide or a precursor, or other components 
of the translocation machinery). Enzymatic assays, dompfementation tests, or 
other suitable methods can also be used in procedures for the identification 
and/or isolation of nucleic acids which encode a polypeptide having the amino 
acid sequence of SEQ ID NO:4 or SEQ ID NO:363, or a functional equivalent 

30 of this polypeptide. The antigenic properties of proteins or polypeptides 
encoded by hybridizing nucleic acids can be determined by immunological 
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methods employing antibodies that bind to a Gene 216 polypeptide such as 
immunoblot, immu no precipitation and radioimmunoassay. PCR methodology, 
including RAGE (Rapid Amplification of Genomic DNA Ends), can also be used 
to screen for and detect the presence of nucleic acids which encode Gene 
5 216-like proteins and polypeptides, and to assist in cloning such nucleic acids 
from genomic DNA. PCR methods for these purposes can be found in M.A. 
Innis et al., 1990, PCR Protocols: A Guide to Methods and Applications, 
Academic Press, Inc., San Diego, CA., incorporated herein by reference. 

It is understood that, as a result of the degeneracy of the genetic code, 

10 many nucleic acid sequences are possible which encode a Gene 216-like 
protein or polypeptide. Some of these will share little identity to the nucleotide 
sequences of any known or naturally-occurring Gene 216-like gene but can be 
used to produce the proteins and polypeptides of this invention by selection of 
combinations of nucleotide triplets based on codon choices. Such variants, 

1 5 while not hybridizabie to a naturally-occurring Gene 21 6 gene under conditions 
of high stringency, are contemplated within this invention. 

Also encompassed by the present invention are alternate splice variants 
produced by differential processing of the primary transcript(s) from Gene 216 
genomic DNA. An alternate splice variant may comprise, for example, the 

20 sequence of any one of SEQ ID NO:2 and SEQ ID NO:350-362. Alternate 
splice variants can also comprise other combinations of introns/exons of SEQ 
ID NO:1 or SEQ ID NO:6, which can be determined by those of skill in the art. 
Alternate splice variants can be determined experimentally, for example, by 
isolating and analyzing cellular RNAs (e.g., Southern blotting or PCR), or by 

25 screening cDNA libraries using the Gene 216 nucleic acid probes or primers 
described herein. In another approach, alternate splice variants can be 
predicted using various methods, computer programs, or computer systems 
available to practitioners in the field. 

General methods for splice site prediction can be found in Nakata, 1985, 

30 Nucleic Acids Res. 13:5327-5340. In addition, splice sites can be predicted 
using, for example, the GRAIL™ (E.C. Uberbacher and R.J. Mural, 1991 , Proc. 
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Nat/. Acad. Sci. USA, 88:11261-11265; E.C. Uberbacher, 1995, Trends 
Biotech., 13:497-500; http://grail.lsd.ornl.gov/graiiexp); GenView (L. Milanesi 
et al., 1993, Proceedings of the Second International Conference on 
Bioinformatics, Supercomputing, and Complex Genome Analysis, H.A. Lim et 
5 al. (eds), World Scientific Publishing, Singapore, pp. 573-588; 
http.7/125.itba.mi.cnr.iiy~webgene/wwwgene_help.html); SpliceView (http://www. 
itba.mi.cnr.it/webgene); and HSPL (V.V. Solovyev et a!., 1994, Nucleic Acids 
Res. 22:5156-5163; V.V. Solovyev et al., 1994, "The Prediction of Human 
Exons by Oligonucleotide Composition and Discriminant Analysis of Spliceable 

10 Open Reading Frames," R. Altman et al. (eds), The Second International 
conference on Intelligent systems for Molecular Biology, AAAl Press, Menlo 
Park, CA, pp. 354-362; V.V. Solovyev et ah, 1993, "Identification Of Human 
Gene Functional Regions Based On Oligonucleotide Composition," L. Hunter 
et al. (eds), In Proceedings of First International conference on Intelligent 

1 5 System for Molecular Biology, Bethesda, pp. 371 -379) computer systems. 

Additionally, computer programs such as GeneParser (E.E. Snyder and 
G.D. Stormo, 1995, J. MoL Biol. 248: 1-18; E.E. Snyder and G.D. Stormo, 
1993, Nucl. Acids Res. 21(3): 607-613; http://mcdb.colorado.edu/~eesnyder/ 
GeneParser.html); MZEF (M.Q. Zhang, 1997, Proc. Natl. Acad. ScL USA, 

20 94:565-568; http://argon.cshl.org/genefinder); MORGAN (S. Salzberg et al., 
1998, J. Comp. Biol. 5:667-680; S. Salzberg et al. (eds), 1998, Computational 
Methods in Molecular Biology, Elsevier Science, New York, NY, pp. 187-203); 
VEIL (J. Henderson et al., 1997, J. Comp. Biol. 4:127-141); GeneScan (S. 
Tiwari et al., 1997, CABIOS (Bioinformatics) 13: 263-270); GeneBuiider (L. 

25 Milanesi et al., 1999, Bioinformatics 15:612-621); Eukaryotic GeneMark (J. 
Besemer et al., 1999, Nucl. Acids Res. 27:3911-3920); and FEXH (V.V. 
Solovyev et al., 1994, Nucleic Acids Res. 22:5156-5163). In addition, splice 
sites (i.e., former or potentia) splice sites) in cDNA sequences can be predicted 
using, for example, the RNASPL (V.V. Solovyev et al., 1994, Nucleic Acids 

30 Res. 22:5156-5163); or INTRON (A. Globeketal., 1991, INTRON version 1.1 
manual, Laboratory of Biochemical Genetics, NIMH, Washington, D.C.) 
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programs. 

The present invention also encompasses naturally-occurring 
polymorphisms of Gene 216. As will be understood by those in the art, the 
genomes of all organisms undergo spontaneous mutation in the course of their 
5 continuing evolution generating variant forms of gene sequences (Gusella, 

1986, Ann. Rev. Biochem. 55:831-854). Restriction fragment length 
polymorphisms (RFLPs) include variations in DNA sequences that alter the 
length of a restriction fragment in the sequence (Botstein et al., 1980, Am. J. 
Hum. Genet 32, 314-331 (1980). RFLPs have been widely used in human 

1 0 and animal genetic analyses (see WO 90/1 3668; WO90/1 1 369; Donis-Keller, 

1987, Cell 51:319-337; Lander et al., 1989, Genetics 121: 85-99). Short 
tandem repeats (STRs) include tandem dh tri- and tetra nucleotide repeated 
motifs, also termed variable number tandem repeat (VNTR) polymorphisms. 
VNTRs have been used in identity and paternity analysis (U.S. Pat. No. 

15 5,075,217; Armour et al., 1992, FEBS Lett 307:113-115; Horn et al., WO 
91/14003; Jeffreys, EP 370,719), and in a large number of genetic mapping 
studies. 

Single nucleotide polymorphisms (SNPs) are far more frequent than 
RFLPS, STRs, and VNTRs. SNPs may occur in protein coding (e.g., exon), or 

20 non-coding (e.g., intron, 5'UTR, 3'UTR) sequences. SNPs in protein coding 
regions may comprise silent mutations that do not alter the amino acid 
sequence of a protein. Alternatively, SNPs in protein coding regions may 
produce conservative or non-conservative amino acid changes, described in 
detail below. In some cases, SNPs may give rise to the expression of a 

25 defective or other variant protein and, potentially, a genetic disease. SNPs 
within protein-coding sequences can give rise to genetic diseases, for example, 
in the (3-globin (sickle cell anemia) and CFTR (cystic fibrosis) genes. In non- 
coding sequences, SNPs may also result in defective protein expression (e.g., 
as a result of defective splicing). Other single nucleotide polymorphisms have 

30 no phenotypic effects. 

Single nucleotide polymorphisms can be used in the same manner as 
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RFLPs and VNTRs, but offer several advantages. Single nucleotide 
polymorphisms tend to occur with greater frequency and are typically spaced 
more uniformly throughout the genome than other polymorphisms. Also, 
different SNPs are often easier to distinguish than other types of 
5 polymorphisms (e.g., by use of assays employing allele-specific hybridization 
probes or primers). In one embodiment of the present invention, a Gene 216 
nucleic acid contains at least one SNP as set forth in Table 10, herein below. 
Various combinations of these SNPs are also encompassed by the invention. 
In a preferred aspect, a Gene 216 SNP is associated with a lung-related 

10 disorder, such as asthma. 

The nucleic acid sequences of the present invention may be derived 
from a variety of sources including DNA, cDNA, synthetic DNA, synthetic RNA, 
or combinations thereof. Such sequences may comprise genomic DNA, which 
may or may not include naturally occurring introns. Moreover, such genomic 

15 DNA may be obtained in association with promoter regions or poly (A) 
sequences. The sequences, genomic DNA, or cDNA may be obtained in any 
of several ways. Genomic DNA can be extracted and purified from suitable 
cells by means well known in the art. Alternatively, mRNA can be isolated from 
a cell and used to produce cDNA by reverse transcription or other means. 

20 The nucleic acids described herein are used in the methods of the 

present invention for production of proteins or polypeptides, through 
incorporation into cells, tissues, or organisms. In one embodiment, DNA 
containing all or part of the coding sequence for a Gene 216 polypeptide, or 
DNA which hybridizes to DNA having the sequence SEQ ID NO:1 or SEQ ID 

25 NO:6, is incorporated into a vector for expression of the encoded polypeptide 
in suitable host cefls. The encoded polypeptide consisting of Gene 216, or its 
functional equivalent is capable of normal activity, such as proteolysis, 
adhesion, fusion, and intracellular activity. 

The invention also concerns the use of the nucleotide sequence of the 

30 nucleic acids of this invention to identify DNA probes for Gene 216 genes, PCR 
primers to amplify Gene 216 genes, nucleotide polymorphisms in Gene 216 
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genes, and regulatory elements of the Gene 216 genes. 

The nucleic acids of the present invention find use as primers and 
templates for the recombinant production of disorder-associated peptides or 
polypeptides, for chromosome and gene mapping, to provide antisense 
5 sequences, for tissue distribution studies, to locate and obtain full length 
genes, to identify and obtain homologous sequences (wild-type and mutants), 
and in diagnostic applications. 

Probes may also be used for the detection of Gene 216-related 
sequences, and should preferably contain at least 50%, preferably at least 

10 80%, identity to Gene 216 polynucleotide, or a complementary sequence, or 
fragments thereof. The probes of this invention may be DNA or RNA, the 
probes may comprise all or a portion of the nucleotide sequence of SEQ ID 
NO:1 or SEQ ID NO:6, or a complementary sequence thereof, and may include 
promoter, enhancer elements, and introns of the naturally occurring Gene 216 

15 polynucleotide. 

The probes and primers based on the Gene 216 gene sequences 
disclosed herein are used to identify homologous Gene 216 gene sequences 
and proteins in other species. These Gene 216 gene sequences and proteins 
are used in the diagnostic/prognostic, therapeutic and drug-screening methods 

20 described herein for the species from which they have been isolated. 
Vectors and Host Cells 

The invention also provides vectors comprising the disorder-associated 
sequences, or derivatives or fragments thereof, and host cells for the 
production of purified proteins. A large number of vectors, including bacterial, 

25 yeast, and mammalian vectors, have been described for replication and/or 
expression in various host cells or cell-free systems, and may be used for gene 
therapy as well as for simple cloning or protein expression. 

In one aspect, an expression vectors comprises a nucleic acid encoding 
a Gene 216 polypeptide or peptide, as described herein, operably linked to at 

30 least one regulatory sequence. Regulatory sequences are known in the art 
and are selected to direct expression of the desired protein in an appropriate 
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host cell. Accordingly, the term regulatory sequence includes promoters, 
enhancers and other expression control elements (see D.V. Goeddel (1990) 
Methods Enzymol. 185:3-7). Enhancer and other expression control 
sequences are described in Enhancers and Eukaryotic Gene Expression, Cold 
5 Spring Harbor Press, Cold Spring Harbor, NY (1983). It should be understood 
that the design of the expression vector may depend on such factors as the 
choice of the host cell to be transfected and/or the type of polypeptide desired 
to be expressed. 

Several regulatory elements (e.g., promoters) have been isolated and 

10 shown to be effective in the transcription and translation of heterologous 
proteins in the various hosts. Such regulatory regions, methods of isolation, 
manner of manipulation, etc. are known in the art. Non-limiting examples of 
bacterial promoters include the p-lactamase (penicillinase) promoter; lactose 
promoter; tryptophan (trp) promoter; araBAD (arabinose) operon promoter; 

15 lambda-derived promoter and N gene ribosome binding site; and the hybrid 
tac promoter derived from sequences of the trp and lac UV5 promoters. Non- 
limiting examples of yeast promoters include the 3-phosphoglycerate kinase 
promoter, glyceraldehyde-3-phosphate dehydrogenase (GAPDH) promoter, 
galactokinase (GAL1) promoter, galactoepimerase promoter, and alcohol 

20 dehydrogenase (ADH1) promoter. Suitable promoters for mammalian cells 
include, without limitation, viral promoters, such as those from Simian Virus 40 
(SV40), Rous sarcoma virus (RSV), adenovirus (ADV), and bovine papilloma 
virus (BPV). Preferred replication and inheritance systems include M13, 
ColE1, SV40, baculovirus, lambda, adenovirus, CEN ARS, 2jj,m ARS and the 

25 like. While expression vectors may replicate autonomously, they may also 
replicate by being inserted into the genome of the Host cell, by methods well 
known in the art. 

To obtain expression in eukaryotic cells, terminator sequences, 
polyadenylation sequences, and enhancer sequences that modulate gene 
30 expression may be required. Sequences that cause amplification of the gene 
may also be desirable. These sequences are well known in the art. 
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Furthermore, sequences that facilitate secretion of the recombinant product 
from cells, including, but not limited to, bacteria, yeast, and animal cells, such 
as secretory signal sequences and/or preprotein or proprotein sequences, may 
also be included. Such sequences are well described in the art. 
5 Expression and cloning vectors will likely contain a selectable marker, 

a gene encoding a protein necessary for survival or growth of a host cell 
transformed with the vector. The presence of this gene ensures growth of only 
those host cells that express the inserts. Typical selection genes encode 
proteins that 1 ) confer resistance to antibiotics or other toxic substances, e.g. 

10 ampicillin, neomycin, methotrexate, etc.; 2) complement auxotrophic 
deficiencies, or 3) supply critical nutrients not available from complex media, 
e.g., the gene encoding D-alanine racemase for Bacilli. Markers may be an 
inducible or non-inducible gene and will generally allow for positive selection. 
Non-limiting examples of markers include the ampicillin resistance marker (i.e., 

15 beta-lactamase), tetracycline resistance marker, neomycin/kanamycin 
resistance marker (i.e., neomycin phosphotransferase), dihydrofolate 
reductase, glutamine synthetase, and the like. The choice of the proper 
selectable marker will depend on the host cell, and appropriate markers for 
different hosts as understood by those of skill in the art. 

20 Suitable expression vectors for use with the present invention include, 

but are not limited to, pUC, pBluescript (Stratagene), pET (Novagen, Inc., 
Madison, Wl), and pREP (Invitrogen) plasmids. Vectors can contain one or 
more replication and inheritance systems for cloning or expression, one or 
more markers for selection in the host, e.g. antibiotic resistance, and one or 

25 more expression cassettes. The inserted coding sequences can be 
synthesized by standard methods, isolated from natural sources, or prepared 
as hybrids. Ligation of the coding sequences to transcriptional regulatory 
elements (e.g., promoters, enhancers, and/or insulators) and/or to other amino 
acid encoding sequences can be carried out using established methods. 

30 Suitable cell-free expression systems for use with the present invention 

include, without limitation, rabbit reticulocyte lysate, wheat germ extract, canine 
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pancreatic microsomal membranes, E. coli S30 extract, and coupled 
transcription/translation systems (Promega Corp., Madison, Wl). These 
systems allow the expression of recombinant polypeptides or peptides upon 
the addition of cloning vectors, DNA fragments, or RNA sequences containing 
5 protein-coding regions and appropriate promoter elements. 

Non-limiting examples of suitable host cells include bacteria, archea, 
insect, fungi (e.g., yeast), plant, and animal cells (e.g., mammalian, especially 
human). Of particular interest are Escherichia cofi, Bacillus subtilis, 
Saccharomyces cerevisiae, SF9 cells, C129 cells, 293 cells, Neurospora, and 

10 immortalized mammalian myeloid and lymphoid cell lines. Techniques for the 
propagation of mammalian cells in culture are well-known (see, Jakoby and 
Pastan (eds), 1979, Cell Culture. Methods in Enzymology, volume 58, 
Academic Press, Inc., Harcourt Brace Jovanovich, NY). Examples of 
commonly used mammalian host cell lines are VERO and HeLa cells, CHO 

1 5 cells, and WI38, BHK, and COS cell lines, although it wiJl be appreciated by the 
skilled practitioner that other cell lines may be used, e.g., to provide higher 
expression desirable glycosylation patterns, or other features. 

Host cells can be transformed, transfected, or infected as appropriate 
by any suitable method including electroporation, calcium chloride-, lithium 

20 chloride-, lithium acetate/polyethylene glycol-, calcium phosphate-, DEAE- 
dextran-, J/posome-mediated DNA uptake, spheroplast/ng, injection, 
microinjection, microprojectile bombardment, phage infection, viral infection, 
or other established methods. Alternatively, vectors containing the nucleic 
acids of interest can be transcribed in vitro, and the resulting RNA introduced 

25 into the host cell by well-known methods, e.g., by injection (see, Kubo et al., 
1988, FEBS Letts. 241:119). The cells into which have been introduced 
nucleic acids described above are meant to also include the progeny of such 
cells. 

The nucleic acids of the invention may be isolated directly from cells. 
30 Alternatively, the polymerase chain reaction (PCR) method can be used to 
produce the nucleic acids of the invention, using either RNA (e.g., mRNA) or 
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DNA (e.g., genomic DNA) as templates. Primers used for PCR can be 
synthesized using the^sequence information provided herein and can further 
be designed to introduce appropriate new restriction sites, if desirable, to 
facilitate incorporation into a given vector for recombinant expression. 
5 Using the information provided in SEQ ID NO:1 and SEQ ID NO:6, one 

skilled in the art will be able to clone and sequence all representative nucleic 
acids of interest, including nucleic acids encoding complete protein-coding 
sequences. It is to be understood that non-protein-coding sequences 
contained within SEQ ID NO:1 and SEQ ID NO:3 and the genomic sequences 

10 of SEQ ID NO:6 and SEQ ID NO:5 are also within the scope of the invention. 
Such sequences include, without limitation, sequences important for 
replication, recombination, transcription, and translation. Non-limiting 
examples include promoters and regulatory binding sites involved in regulation 
of gene expression, and 5'- and 3'- untranslated sequences (e.g., ribosome- 

1 5 binding sites) that form part of mRNA molecules. 

The nucleic acids of this invention can be produced in large quantities 
by replication in a suitable host cell. Natural or synthetic nucleic acid 
fragments, comprising at least ten contiguous bases coding for a desired 
peptide or polypeptide can be incorporated into recombinant nucleic acid 

20 constructs, usually DNA constructs, capable of introduction into and replication 
in a prokaryotic or eukaryotic cell. Usually the nucleic acid constructs will be 
suitable for replication in a unicellular host, such as yeast or bacteria, but may 
also be intended for introduction to (with and without integration within the 
genome) cultured mammalian or plant or other eukaryotic cells, cell lines, 

25 tissues, or organisms. The purification of nucleic acids produced by the 
methods of the present invention is described, for example, in Sambrook et al., 
1989; F.M. Ausubel et al., 1992, Current Protocols in Molecular Biology, J. 
Wiley and Sons, New York, NY. 

The nucleic acids of the present invention can also be produced by 

30 chemical synthesis, e.g., by the phosphoramidite method described by 
Beaucage et al., 1981, Tetra. Letts. 22:1859-1862, or the triester method 
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according to Matteucci et al., 1981, J. Am. Chem. Soc, 103:3185, and can 
performed on commercial, automated oligonucleotide synthesizers. A double- 
stranded fragment may be obtained from the single-stranded product of 
chemical synthesis either by synthesizing the complementary strand and 
5 annealing the strands together under appropriate conditions or by adding the 
complementary strand using DNA polymerase with an appropriate primer 
sequence. 

These nucleic acids can encode full-length variant forms of proteins as 
well as the wild-type protein. The variant proteins (which could be especially 

1 0 useful for detection and treatment of disorders) will have the variant amino acid 
sequences encoded by the polymorphisms described in Table 10, when said 
polymorphisms are read so as to be in -frame with the full-length coding 
sequence of which it is a component. 

Large quantities of the nucleic acids and proteins of the present 

15 invention may be prepared by expressing the Gene 216 nucleic acids or 
portions thereof in vectors or other expression vehicles in compatible 
prokaryotic or eukaryotic host cells. The most commonly used prokaryotic 
hosts are strains of Escherichia coii, although other prokaryotes, such as 
Bacillus subtilis or Pseudomonas may also be used. Mammalian or other 

20 eukaryotic host cells, such as those of yeast, filamentous fungi, plant, insect, 
or amphibian or avian species, may also be useful for production of the 
proteins of the present invention. For example, insect cell systems (i.e., 
lepidopteran host cells and baculovirus expression vectors) are particularly 
suited for large-scale protein production. 

25 Host cells carrying an expression vector (i.e., transformants or clones) 

are selected using markers depending on the mode of the vector construction. 
The marker may be on the same or a different DNA molecule, preferably the 
same DNA molecule. In prokaryotic hosts, the transformant may be selected, 
e.g., by resistance to ampicillin, tetracycline or other antibiotics. Production of 

30 a particular product based on temperature sensitivity may also serve as an 
appropriate marker. 
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Prokaryotic or eukaryotic cells comprising the nucleic acids of the 
present invention will be useful not only for the production of the nucleic acids 
and proteins of the present invention, but also, for example, in studying the 
characteristics of Gene 216 proteins. Cells and animals that carry the Gene 
5 216 gene can be used as model systems to study and test for substances that 
have potential as therapeutic agents. The cells are typically cultured 
mesenchymal stem cells. These may be isolated from individuals with somatic 
or germline Gene 216 gene. Alternatively, the cell line can be engineered to 
carry the Gene 216 genes, as described above. After a test substance is 
10 applied to the cells, the transformed phenotype of the ceil is determined. Any 
trait of transformed cells can be assessed, including respiratory diseases 
including asthma, atopy, and response to application of putative therapeutic 
agents. 

Antisense Nucleic Acids 

15 A further embodiment of the invention is antisense nucleic acids or 

oligonucleotides that are complementary, in whole or in part, to a target 
molecule comprising a sense strand of Gene 216. The Gene 216 target can 
be DNA, or its RNA counterpart (i.e., wherein thymine (T) is present in DNA 
and uracil (U) is present in RNA). When introduced into a cell, antisense 

20 nucleic acids or oligonucleotides can hybridize to all or a part of the sense 
strand of Gene 216, thereby inhibiting gene expression or replication. 

In a particular embodiment of the invention, an antisense nucleic acid 
or oligonucleotide is wholly or partially complementary to, and can hybridize 
with, a target nucleic acid (either DNA or RNA) having the sequence of SEQ 

25 ID NO:1 or SEQ ID NO:6. For example, an antisense nucleic acid or 
oligonucleotide comprising 16 nucleotides can be sufficient to inhibit 
expression of the Gene 216 protein. Alternatively, an antisense nucleic acid 
or oligonucleotide can be complementary to 5* or 3' untranslated regions, or 
can overlap the translation initiation codon (5' untranslated and translated 

30 regions) of the Gene 216 gene, or its functional equivalent. In another 
embodiment, the antisense nucleic acid is wholly or partially complementary 
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to, and can hybridize with, a target nucleic acid that encodes a Gene 216 
polypeptide. 

In addition, oligonucleotides can be constructed which will bind to 
duplex nucleic acid (i.e., DNA:DNA or DNA:RNA), to form a stable triple helix- 
5 containing or triplex nucleic acid. Such triplex oligonucleotides can inhibit 
transcription and/or expression of a gene encoding Gene 21 6, or its functional 
equivalent (M.D. Frank-Kamenetskii and S.M. Mirkin, 1995, Ann. Rev. 
Biochem. 64:65-95). Triplex oligonucleotides are constructed using the base- 
pairing rules of triple helix formation and the nucleotide sequence of the gene 

10 or mRNA for Gene 21 6. 

The present invention encompasses methods of using oligonucleotides 
in antisense inhibition of the function of Gene 216. In the context of this 
invention, the term "oligonucleotide" refers to naturally-occurring species or 
synthetic species formed from naturally-occurring subunits or their close 

1 5 homologs. The term may also refer to moieties that function similarly to 
oligonucleotides, but have non-naturally-occurring portions. Thus, 
oligonucleotides may have altered sugar moieties or inter-sugar linkages. 
Exemplary among these are phosphorothioate and other sulfur containing 
species which are known in the art. 

20 In preferred embodiments, at least one of the phosphodiester bonds of 

the oligonucleotide has been substituted with a structure that functions to 
enhance the ability of the compositions to penetrate into the region of cells 
where the RNA whose activity is to be modulated is located. It is preferred that 
such substitutions comprise phosphorothioate bonds, methyl phosphonate 

25 bonds, or short chain alkyl or cycloalkyl structures. In accordance with other 
preferred embodiments, the phosphodiester bonfcte are substituted with 
structures which are, at once, substantially non-ionic and non-chiral, or with 
structures which are chiral and enantiomerically specific. Persons of ordinary 
skill in the art will be able to select other linkages for use in the practice of the 

30 invention. 

Oligonucleotides may also include species that include at least some 
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modified base forms. Thus, purines and pyrimidines other than those normally 
found in nature may be so employed. Similarly, modifications on the furanosyl 
portions of the nucleotide subunits may also be effected, as long as the 
essential tenets of this invention are adhered to. Examples of such 
5 modifications are 2'-0-alkyl~ and 2'-halogen-substituted nucleotides. Some 
non-limiting examples of modifications at the 2' position of sugar moieties 
which are useful in the present invention include OH, SH, SCH 3 , F, OCH 3 , 
OCN, 0(CH 2 ) n NH 2 and 0(CH 2 ) n CH 3 , where n is from 1 to about 10. Such 
oligonucleotides are functionally interchangeable with natural oligonucleotides 

10 or synthesized oligonucleotides, which have one or more differences from the 
natural structure. All such analogs are comprehended by this invention so long 
as they function effectively to hybridize with Gene 216 DNA or RNA to inhibit 
the function thereof. 

The oligonucleotides in accordance with this invention preferably 

15 comprise from about 3 to about 50 subunits. It is more preferred that such 
oligonucleotides and analogs comprise from about 8 to about 25 subunits and 
still more preferred to have from about 12 to about 20 subunits. As defined 
herein, a "subunit" is a base and sugar combination suitably bound to adjacent 
subunits through phosphodiester or other bonds. 

20 Antisense nucleic acids or oligonulcleotides can be produced by 

standard techniques (see, e.g., Shewmaker et a!., U.S. Patent No. 5,107,065. 
The oligonucleotides used in accordance with this invention may be 
conveniently and routinely made through the well-known technique of solid 
phase synthesis. Equipment for such synthesis is available from several 

25 vendors, including PE Applied Biosystems (Foster City, CA). Any other means 
for such synthesis may also be employed, however, the actual synthesis of the 
oligonucleotides is well within the abilities of the practitioner. It is also will 
known to prepare other oligonucleotide such as phosphorothioates and 
alkylated derivatives. 

30 The oligonucleotides of this invention are designed to be hybridizable 

with Gene 216 RNA (e.g., mRNA) or DNA. For example, an oligonucleotide 
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(e.g., DNA oligonucleotide) that hybridizes to Gene 216 mRNA can be used to 
target the mRNA for RnaseH digestion. Alternatively, an oligonucleotide that 
hybridizes to the translation initiation site of Gene 216 mRNA can be used to 
prevent translation of the mRNA. In another approach, oligonucleotides that 
5 bind to the double-stranded DNA of Gene 216 can be administered. Such 
oligonucleotides can form a triplex construct and inhibit the transcription of the 
DNA encoding Gene 216 polypeptides. Triple helix pairing prevents the double 
helix from opening sufficiently to allow the binding of polymerases, transcription 
factors, or regulatory molecules. Recent therapeutic advances using triplex 

10 DNA have been described (see, e.g., J.E. Gee et al., 1994, Molecular and 
Immunologic Approaches, Futura Publishing Co., Mt. Kisco, NY). 

As non-limiting examples, antisense oligonucleotides may be targeted 
to hybridize to the following regions: mRNA cap region; translation initiation 
site; translational termination site; transcription initiation site; transcription 

15 termination site; polyadenylation signal; 3' untranslated region; 5' untranslated 
region; 5' coding region; mid coding region; and 3' coding region. Preferably, 
the complementary oligonucleotide is designed to hybridize to the most unique 
5' sequence Gene 216, including any of about 15-35 nucleotides spanning the 
5' coding sequence. Appropriate oligonucleotides can be designed using 

20 OLIGO software (Molecular Biology Insights, Inc., Cascade, CO; 
http://www.oligo.net). 

In accordance with the present invention, the antisense oligonucleotide 
can be synthesized, formulated as a pharmaceutical composition, and 
administered to a subject. The synthesis and utilization of antisense and triplex 

25 oligonucleotides have been previously described (e.g., H. Simon et al., 1999, 
Antisense Nucleic Acid Drug Dev. 9:527-31 ; F.X. Barre et al., 2000, Proc. Natl. 
Acad. ScL USA 97:3084-3088; R. Elez et al., 2000, Bhchem. Biophys. Res. 
Commun. 269:352-6; E.R. Sauter et al., 2000, Clin. Cancer Res. 6:654-60). 
Alternatively, expression vectors derived from retroviruses, adenovirus, herpes 

30 or vaccinia viruses, or from various bacterial plasmids may be used for delivery 
of nucleotide sequences to the targeted organ, tissue or ceil population. 
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Methods which are well known to those skilled in the art can be used to 
construct recombinant vectors which will express nucleic acid sequence that 
is complementary to the nucleic acid sequence encoding a Gene 216 
polypeptide. These techniques are described both in Sambrook et al., 1989 
5 and in Ausubel et al., 1992. For example, Gene 216 expression can be 
inhibited by transforming a cell or tissue with an expression vector that 
expresses high levels of untranslatable sense or antisense Gene 216 
sequences. Even in the absence of integration into the DNA, such vectors may 
continue to transcribe RNA molecules until they are disabled by endogenous 

10 nucleases. Transient express/on may last for a month or more with a non- 
replicating vector, and even longer if appropriate replication elements included 
in the vector system. 

Various assays may be used to test the ability of Gene 216-specific 
antisense oligonucleotides to inhibit Gene 216 expression. For example, Gene 

15 216 mRNA levels can be assessed northern blot analysis (Sambrook et a)., 
1989; Ausubel et al., 1992; J.C. Alwine et al. 1977, Proc. Natl. Acad. ScL USA 
74:5350-5354; I.M. Bird, 1998, Methods Mo I. Biol. 105:325-36), quantitative or 
semi-quantitative RT-PCR analysis (see, e.g., W.M. Freeman et al., 1999, 
Biotechniques 26:112-122; Ren et al., 1998, Moi. Brain Res. 59:256-63; J.M. 

20 Cale et al., 1998, Methods Moi Biol. 105:351-71), or in situ hybridization 
(reviewed by A.K. Raap, 1998, Mutat. Res. 400:287-298). Alternatively, 
antisense oligonucleotides may be assessed by measuring levels of Gene 21 6 
polypeptide, e.g., by western blot analysis, indirect immunofluorescence, 
immunoprecipitation techniques (see, e.g., J.M. Walker, 1998, Protein 

25 Protocols on CD-ROM, Humana Press, Totowa, NJ). 
Polypeptides 

The invention also relates to polypeptides and peptides encoded by the 
novel nucleic acids described herein. The polypeptides and peptides of this 
invention can be isolated and/or recombinant. In a preferred embodiment, the 
30 Gene 216 polypeptide, or analog or portion thereof, has at least one function 
characteristic of a Gene 216 protein, for example, proteolysis, adhesion, 
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fusion, antigenic, and intracellular activity. Protein analogs include, for 
example, naturally-occurring or genetically engineered Gene 216 variants (e.g. 
mutants) and portions thereof. Variants may differ from wild-type Gene 216 
protein by the addition, deletion, or substitution of one or more amino acid 
5 residues. In specific embodiments, polypeptide variants are encoded by Gene 
216 nucleic acids containing one or more of the SNPs disclosed herein. 
Variants also include polypeptides in which one or more residues are modified 
(i.e., by phosphorylation, sulfation, acylation, etc.), and mutants comprising one 
or more modified residues. 

10 Variant polypeptides can have conservative changes, wherein a 

substituted amino acid has similar structural or chemical properties, e.g., 
replacement of leucine with isoleucine. More infrequently, a variant 
polypeptide can have non-conservative changes, e.g., substitution of a glycine 
with a tryptophan. Guidance in determining which amino acid residues can be 

1 5 substituted, inserted, or deleted without abolishing biological or immunological 
activity can be found using computer programs well known in the art, for 
example, DNASTAR software (DNASTAR, Inc., Madison, W() 

As non-limiting examples, conservative substitutions in the Gene 216 
amino acid sequence can be made in accordance with the following table: 



Original Residue 


Conservative Substitution(s) 


Ala 


Ser 


Arg 


Lys 


Asn 


Gin, His 


Asp 


Glu 


Cys 


Ser 


Gin 


Asn 


Giu 


Asp 


Gly 


Pro 


His 


Asn, Gin 


lie 


Leu, Val 


Leu 


lie, Val 


Lys 


Arg, Gin, Glu 


Met 


Leu, lie 


Phe 


Met, Leu, Tyr 


Ser 


Thr 


Thr 


Ser 
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Trp 


Tyr 


Tyr 


Trp, Phe 


Val 


He, Leu 



Substantial changes in function or immunogenicity can be made by 
selecting substitutions that are less conservative than those shown in the table, 
above. For example, non-conservative substitutions can be made which more 
5 significantly affect the structure of the polypeptide in the area of the alteration, 
for example, the alpha-helical, or beta-sheet structure; the charge or 
hydrophobicity of the molecule at the target site; or the bulk of the side chain. 
The substitutions which generalfy are expected to produce the greatest 
changes in the polypeptide's properties are those where 1) a hydrophilic 

1 0 residue, e.g., seryl or threonyl, is substituted for (or by) a hydrophobic residue, 
e.g., leucyl, isoleucyl, phenylalanyl, valyl, or alanyl; 2) a cysteine or proline is 
substituted for (or by) any other residue; 3) a residue having an electropositive 
side cha\n t e.g., lysyl, arginyl or hrst/dyl, is substituted for (or by) an 
electronegative residue, e.g., glutamyl or aspartyl; or 4) a residue having a 

15 bulky side chain, e.g., phenylalanine, is substituted for (or by) a residue that 
does not have a side chain, e.g., glycine. 

In one embodiment, polypeptides of the present invention share at least 
50% amino acid sequence identity with a Gene 21 6 polypeptide, such as SEQ 
ID MO:4, or fragments thereof. Preferably, the polypeptides share at least 65% 

20 amino acid sequence identity; more preferably, the polypeptides share at least 
75% amino acid sequence identity; even more preferably, the polypeptides 
share at least 80% amino acid sequence identity with a Gene 216 polypeptide; 
still more preferably the polypeptides share at least 90% amino acid sequence 
identity with a Gene 216 polypeptide. 

25 Percent sequence identity can be calculated using computer programs 

or direct sequence comparison. Preferred computer program methods to 
determine identity between two sequences include, but are not limited to, the 
GCG program package, FASTA, BLASTP, and TBLASTN (see, e.g., D.W. 
Mount, 2001 , Bioinformatics: Sequence and Genome Analysis, Cold Spring 
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Harbor Laboratory Press, Cold Spring Harbor, NY). The BLASTP and 
TBLASTN programs are publicly available from NCBI and other sources. The 
well-known Smith Waterman algorithm may also be used to determine identity. 
Exemplary parameters for amino acid sequence comparison include the 
5 following: 1) algorithm from Needleman and Wunsch, 1970, J Mol. Biol. 
48:443-453; 2) BLOSSUM62 comparison matrix from Hentikoff and Hentikoff, 
1992, Proc. Natl. Acad. Sci. USA 89:10915-10919; 3) gap penalty = 12; and 
4) gap length penalty =4. A program useful with these parameters is publicly 
available as the "gap" program (Genetics Computer Group, Madison, Wl). The 

10 aforementioned parameters are the default parameters for polypeptide 
comparisons (with no penalty for end gaps). 

Alternatively, polypeptide sequence identity can be calculated using the 
following equation: % identity = (the number of identical residues) / (alignment 
length in amino acid residues) * 100. For this calculation, alignment length 

15 includes internal gaps but does not include terminal gaps. 

In accordance with the present invention, polypeptide sequences may 
be identical to the sequence of SEQ ID NO:4, or may include up to a certain 
integer number of amino acid alterations. Polypeptide alterations are selected 
from the group consisting of at least one amino acid deletion, substitution, 

20 including conservative and non-conservative substitution, or insertion. 
Alterations may occur at the amino- or carboxy-terminal positions of the 
reference polypeptide sequence or anywhere between those terminal 
positions, interspersed either individually among the amino acids in the 
reference sequence or in one or more contiguous groups within the reference 

25 sequence. In specific embodiments, polypeptide variants may be encoded by 
Gene 216 nucleic acids comprising SNPs and/or alternate splice variants. 

The invention also relates to isolated, synthesized and/or recombinant 
portions or fragments of a Gene 216 protein or polypeptide as described 
herein. Polypeptide fragments (i.e., peptides) can be made which have full or 

30 partial function on their own, or which when mixed together (though fully, 
partially, or nonfunctional alone), spontaneously assemble with one or more 
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other polypeptides to reconstitute a functional protein having at least one 
functional characteristic of a Gene 216 protein of this invention. In addition, 
Gene 216 polypeptide fragments may comprise, for example, one or more 
domains of the Gene 216 polypeptide (e.g., the pre-, pro-, catalytic, cysteine- 
5 rich, disintegrin, EGF, transmembrane, and cytoplasmic domains) disclosed 
herein. 

Polypeptides according to the invention can comprise at least 5 amino 
acid residues; preferably the polypeptides comprise at least 12 residues; more 
preferably the polypeptides comprise at least 20 residues; and yet more 

10 preferably the polypeptides comprise at least 30 residues. Nucleic acids 
comprising protein-coding sequences can be used to direct the expression of 
asthma-associated polypeptides in intact cells or in cell-free translation 
systems. The coding sequence can be tailored, if desired, for more efficient 
expression in a given host organism, and can be used to synthesize 

15 oligonucleotides encoding the desired amino acid sequences. The resulting 
oligonucleotides can be inserted into an appropriate vector and expressed in 
a compatible host organism or translation system. 

The polypeptides of the present invention, including function- 
conservative variants, may be isolated from wild-type or mutant cells (e.g., 

20 human cells or cell lines), from heterologous organisms or cells (e.g., bacteria, 
yeast, insect, plant, and mammalian cells), or from cell-free translation systems 
(e.g., wheat germ, microsomal membrane, or bacterial extracts) in which a 
protein-coding sequence has been introduced and expressed. Furthermore, 
the polypeptides may be part of recombinant fusion proteins. The polypeptides 

25 can also, advantageously, be made by synthetic chemistry. Polypeptides may 
be chemically synthesized by commercially available automated procedures, 
including, without limitation, exclusive solid phase synthesis, partial solid phase 
methods, fragment condensation or classical solution synthesis. 

Methods for polypeptide purification are well-known in the art, including, 

30 without limitation, preparative disc-gel electrophoresis, isoelectric focusing, 
HPLC, reversed-phase HPLC, gel filtration, ion exchange and partition 
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chromatography, and cou intercurrent distribution. For some purposes, it is 
preferable to produce the polypeptide in a recombinant system in which the 
protein contains an additional sequence (e.g., epitope or protein) tag that 
facilitates purification. Non-limiting examples of epitope tags include c-myc, 
5 haemagglutinin (HA), polyhistidine (6X-H1S) (SEQ ID NO:32), GLU-GLU, and 
DYKDDDDK (SEQ ID NO:33) (FLAG®) epitope tags. Non-limiting examples 
of protein tags include glutathione-S-transferase (GST), green fluorescent 
protein (GFP), and maltose binding protein (MBP). 

In one approach, the coding sequence of a polypeptide or peptide can 

10 be cloned into a vector that creates a fusion with a sequence tag of interest. 
Suitable vectors include, without limitation, pRSET (Invitrogen Corp., San 
Diego, CA), pGEX (Amersham-Pharmacia Biotech, Inc., Piscataway, NJ), 
pEGFP (CLONTECH Laboratories, Inc., Palo Alto, CA), and pMAL™ (New 
England BioLabs (NEB), Inc., Beverly, MA) plasmids. Following expression, 

15 the epitope, or protein tagged polypeptide or peptide can be purified from a 
crude lysate of the translation system or host cell by chromatography on an 
appropriate solid-phase matrix. In some cases, it may be preferable to remove 
the epitope or protein tag (i.e., via protease cleavage) following purification. As 
an alternative approach, antibodies produced against a disorder-associated 

20 protein or against peptides derived therefrom can be used as purification 
reagents. Other purification methods are possible. 

The present invention also encompasses polypeptide derivatives of 
Gene 216. The isolated polypeptides may be modified by, for example, 
phosphorylation, sulfation, acylation, or other protein modifications. They may 

25 also be modified with a label capable of providing a detectable signal, either 
directly or indirectly, including, but not limited to, radioisotopes and fluorescent 
compounds. 

Both the naturally occurring and recombinant forms of the polypeptides 
of the invention can advantageously be used to screen compounds for binding 
30 activity. Many methods of screening for binding activity are known by those 
skilled in the art and may be used to practice the invention. Several methods 
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of automated assays have been developed in recent years so as to permit 
screening of tens of thousands of compounds in a short period of time. Such 
high-throughput screening methods are particularly preferred. The use of high- 
throughput screening assays to test for inhibitors is greatly facilitated by the 
5 availability of large amounts of purified polypeptides, as provided by the 
invention. The polypeptides of the invention also find use as therapeutic 
agents as well as antigenic components to prepare antibodies. 

The polypeptides of this invention find use as immunogenic components 
useful as antigens for preparing antibodies by standard methods. It is well 

10 known in the art that immunogenic epitopes generally contain at least about 
five amino acid residues (Ohno et a!., 1985, Proc. Natl. Acad. Sci. USA 
82:2945). Therefore, the immunogenic components of this invention will 
typically comprise at least 5 amino acid residues of the sequence of the 
complete polypeptide chains. Preferably, they will contain at least 7, and most 

15 preferably at least about 10 amino acid residues or more to ensure that they 
will be immunogenic. Whether a given component Is immunogenic can readily 
be determined by routine experimentation Such immunogenic components 
can be produced by proteolytic cleavage of larger polypeptides or by chemical 
synthesis or recombinant technology and are thus not limited by proteolytic 

20 cleavage sites. The present invention thus encompasses antibodies that 
specifically recognize asthma-associated immunogenic components. 
Structural Studies 

A purified Gene 216 polypeptide can be analyzed by well-established 
methods (e.g., X-ray crystallography, NMR, CD, etc.) to determine the three- 

25 dimensional structure of the molecule. The three-dimensional structure, in 
turn, can be used to model intermolecular interactions. Exemplary methods for 
crystallization and X-ray crystallography are found in P.G. Jones, 1981, 
Chemistry in Britain, 17:222-225; C. Jones et al. (eds), Crystallographic 
Methods and Protocols, Humana Press, Totowa, NJ; A. McPherson, 1982, 

30 Preparation and Analysis of Protein Crystals, John Wiley & Sons, New York, 
NY; T.L. Blundell and L.N. Johnson, 1976, Protein Crystallography, Academic 
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Press, inc., New York, NY; A. Holden and P. Singer, 1960, Crystals and Crystal 
Growing, Anchor Books-Doubleday, New York, NY; R.A. Laudise, 1970, The 
Growth of Single Crystals, Solid State Physical Electronics Series, N. 
Holonyak, Jr., (ed), Prentice-Hall, Inc.; G.H. Stout and L.H. Jensen, 1989, X- 
5 ray Structure Determination: A Practical Guide, 2nd edition, John Wiliey & 
Sons, New York, NY; Fundamentals of Analytical Chemistry, 3rd. edition, 
Saunders Golden Sunburst Series, Holt, Rinehartand Winston, Philadelphia, 
PA, 1976; P.D. Boyle of the Department of Chemistry of North Carolina State 
University at http://laue.chem.ncsu.edu/web/GrowXtal.html; M.B. Berry, 1995, 

10 Protein Crystalization: Theory and Practice, Structure and Dynamics ofE. coli 
Adenylate Kinase, Doctoral Thesis, Rice University, Houston TX; 
www.bioc.rice.edu/-berry/papers/crystalization/ crystalization.html. 

For X-ray diffraction studies, single crystals can be grown to suitable 
size. Preferably, a crystal has a size of 0.2 to 0.4 mm in at least two of the 

1 5 three dimensions. Crystals can be formed in a solution comprising a Gene 216 
polypeptide (e.g., 1 .5-200 mg/ml) and reagents that reduce the solubility to 
conditions close to spontaneous precipitation. Factors that affect the formation 
of polypeptide crystals include: 1 ) purity; 2) substrates or co-factors; 3) pH; 4) 
temperature; 5) polypeptide concentration; and 6) characteristics of the 

20 precipitant. Preferably, the Gene 216 polypeptides are pure, i.e., free from 
contaminating components (at least 95% pure), and free from denatured Gene 
216 polypeptides. In particular, polypeptides can be purified by FPLC and 
HPLC techniques to assure homogeneity (see, Lin et al., 1992, J. Crystal. 
Growth. 122:242-245). Optionally, Gene 216 polypeptide substrates or co- 

25 factors can be added to stabilize the quaternary structure of the protein and 
promote lattice packing. 

Suitable precipitants for crystallization include, but are not limited to, 
salts (e.g., ammonium sulphate, potassium phosphate); polymers (e.g., 
polyethylene glycol (PEG) 6000); alcohols (e.g., ethanol); polyalcohols (e.g., 

30 1-methyl-2,4 pentane dioi (MPD)); organic solvents; sulfonic dyes; and 
deionized water. The ability of a salt to precipitate polypeptides can be 
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generally described by the Hofmeister series: P0 4 3 " > HPO4 2 " = S0 4 2 ~ > citrate 
> CH 3 C0 2 "> CI' > Br"> N0 3 "> C10 4 " > SCN"; and NH 4 + > K + > Na + > Li + . Non- 
limiting examples of salt precipitants are shown below (see Berry, 1995). 



Precipitant 


Maximum concentration 


(NH 4 7Na7Lf) 2 or Mg 2 +S0 4 z * 


4.0/1.5/2.1 /2.5M 


NH 4 7lMa + /K + P0 4 a " 


3.0 / 4.0/4.0 M 


NH 4 7K7Na + /Lf citrate 


-1.8 M 


NH 4 7K7Na7Li + acetate 


-3.0 M 


NH 4 7K7Na7Lf Cl" 


5.2 / 9.8 / 4.2 / 5.4 M 


NH 4 + N0 3 " 


-8.0 M 



5 High molecular weight polymers useful as precipitating agents include 

polyethylene glycol (PEG), dextran, polyvinyl alcohol, and polyvinyl pyrrolidone 
(A. Poison et al., 1964, Biochem. Biophys. Acta. 82:463-475). In general, 
polyethylene glycol (PEG) is the most effective for forming crystals. PEG 
compounds with molecular weights less than 1000 can be used at 

1 0 concentrations above 40% v/v. PEGs with molecular weights above 1000 can 
be used at concentration 5-50% w/v. Typically, PEG solutions are mixed with 
~0.l % sodium azide to prevent bacterial growth. 

Typically, crystallization requires the addition of buffers and a specific 
salt content to maintain the proper pH and ionic strength for a protein's stability. 

15 Suitable additives include, but are not limited to sodium chloride (e.g., 50-500 
mM as additive to PEG and MPD; 0.15-2 M as additive to PEG); potassium 
chloride (e.g., 0.05-2 M); lithium chloride (e.g., 0.05-2 M); sodium fluoride (e.g., 
20-300 mM); ammonium sulfate (e.g., 20-300 mM); lithium sulfate (e.g., 0.05-2 
M); sodium or ammonium thiocyanate (e.g., 50-500 mM); MPD (e.g., 0.5-50%); 

20 1,6 hexane diol (e.g., 0.5-10%); 1,2,3 heptane triol (e.g., 0.5-15%); and 
benzamidine (e.g., 0.5-15%). 

Detergents may be used to maintain protein solubility and prevent 
aggregation. Suitable detergents include, but are not limited to non-ionic 
detergents such as sugar derivatives, oligoethyleneglycol derivatives, 

25 dimethylamine-N-oxides, cholate derivatives, N-octyl hydroxyalkylsulphoxides, 
sulphobetains, and lipid-like detergents.. Sugar-derived detergents include alkyl 
glucopyranosides (e.g., C8-GP, C9-GP), alkyl thio-glucopyranosides (e.g., C8- 
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tGP), alkyl maltopyranosides (e.g., C10-M, C12-M; CYMAL-3, CYMAL-5, 
CYMAL-6), alkyl thio-maltopyranosides, alkyl galactopyranosides, alkyl 
sucroses (e.g., N-octanoylsucrose), and glucamides (e.g., HECAMEG, C- 
HEGA-10; MEGA-8). Oligoethyleneglycol-derived detergents include alkyl 
5 polyoxyethylenes (e.g., C8-E5, C8-En; C12-E8; C12-E9) and phenyl 
polyoxyethylenes (e.g., Triton X-100). Dimethylamine-N-oxide detergents 
include, e.g., C10-DAO; DDAO; LDAO. Cholate-de rived detergents include, 
e.g., Deoxy-Big CHAP, digitonin. Lipid-like detergents include phosphocholine 
compounds. Suitable detergents further include zwitter-ionic detergents (e.g., 

10 ZWITTERGENT 3-10; ZWITTERGENT 3-12); and ionic detergents (e.g., SDS). 

Crystallization of macro molecules has been performed at temperatures 
ranging from 60°C to less than 0°C. However, most molecules can be 
crystallized at 4°C or 22°C. Lower temperatures promote stabilization of 
polypeptides and inhibit bacterial growth. In general, polypeptides are more 

15 soluble in salt solutions at lower temperatures (e.g., 4°C), but less soluble in 
PEG and MPD solutions at lower temperatures. To allow crystallization at 4°C 
or 22°C, the precipitant or protein concentration can be increased or decreased 
as required. Heating, melting, and cooling of crystals or aggregates can be 
used to enlarge crystals. In addition, crystallization at both 4°C and 22°C can 

20 be assessed (A. McPherson, 1992, J. Cryst Growth. 122:161-167; C.W. 
Carter, Jr. and C.W. Carter, 1979, J. Biol. Chem. 254:12219-12223; T. 
Bergfors, 1993, Crystalization Lab Manual). 

A crystallization protocol can be adapted to a particular polypeptide or 
peptide. In particular, the physical and chemical properties of the polypeptide 

25 can be considered (e.g., aggregation, stability, adherence to membranes or 
tubing, internal disulfide linkages, surface cysteines, chelating ions, etc.). For 
initial experiments, the standard set of crystalization reagents can be used 
(Hampton Research, Laguna Niguel, CA). In addition, the CRYSTOOL 
program can provide guidance in determining optimal crystallization conditions 

30 (Brent Segelke, 1995, Efficiency analysis of sampling protocols used in protein 
crystallization screening and crystal structure from two novel crystal forms of 
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PLA2, Ph.D. Thesis, University of California, San Diego; http://www. 
c^p14.ac.uk/ccp/web-mirrors/lin!rupp/crystool/crystool.htm). Exemplary 
crystallization conditions are shown below (see Berry, 1995). 



Major Precipitant 


Additive 


Concentration of 
Major Precipitant 


Concentration 
of Additive 


(NH 4 ) 2 S0 4 


PEG 400-2000, MPD, 
ethanol, or methanol 


2.0-4.0 M 


6%-0.5% 


Na citrate 


PEG 400-2000, MPD, 
ethanol, or methanol 


1.4-1.8 M 


6%-0.5% 


PEG 1000-20000 


(NH4) Z S0 4 , NaCI, or Na 
formate 


40-50% 


0.2-0.6 M 



5 

Robots can be used for automatic screening and optimization of 
crystallization conditions. For example, the IMPAX and Oryx systems can be 
used (Douglas Instruments, Ltd., East Garston, United Kingdom). The 
CRYSTOOL program (SegeJke, supra) can be integrated with the robotics 

10 programming. In addition, the Xact program can be used to construct, 
maintain, and record the results of various crystallization experiments (see, 
e.g., D.E. Brodersen et al., 1999, J. Appf. Cryst 32: 1012-1016; G.R. Andersen 
and J. Nyborg, 1996, J. Appl. Cryst 29:236-240). The Xact program supports 
multiple users and organizes the results of crystallization experiments into 

15 hierarchies. Advantageously, Xact is compatible with both CRYSTOOL and 
Microsoft® Excel programs. 

Four methods are commonly employed to crystallize macromolecules: 
vapor diffusion, free interface diffusion, batch, and dialysis. The vapor 
diffusion technique is typically performed by formulating a 1:1 mixture of a 

20 solution comprising the polypeptide of interest and a solution containing the 
precipitant at the final concentration that is to be achieved after vapor 
equilibration. The drop containing the 1:1 mixture of protein and precipitant is 
then suspended and sealed over the well solution, which contains the 
precipitant at the target concentration, as either a hanging or sitting drop. 

25 Vapor diffusion can be used to screen a large number of crystallization 
conditions or when small amounts of polypeptide are available. For screening, 
drop sizes of 1 to 2 jxl can be used. Once preliminary crystallization conditions 
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have been determined, drop sizes such as 10 ^il can be used. Notably, results 
from hanging drops may be improved with agarose gels (see K. Provost and 
M.-C. Robert, 1991, J. Cryst Growth. 110:258-264). Free interface diffusion 
is performed by layering of a low density solution onto one of higher density, 
5 usually in the form of concentrated protein onto concentrated salt Since the 
solute to be crystallized must be concentrated, this method typically requires 
relatively large amounts of protein. However, the method can be adapted to 
work with small amounts of protein. In a representative experiment, 2 to 5 p.l 
of sample is pipetted into one end of a 20 p.\ microcapillary pipet. Next, 2 to 5 

10 iil of precipitant is pipetted into the capillary without introducing an air bubble, 
and the ends of the pipet are sealed. With sufficient amounts of protein, this 
method can be used to obtain relatively large crystals (see, e.g., S.M. Althoff 
et al., 1988, J. Mol. BioL 199:665-666). 

The batch technique is performed by mixing concentrated polypeptide 

15 with concentrated precipitant to produce a final concentration that is 
supersaturated for the solute macromolecule. Notably, this method can 
employ relatively large amounts of solution (e.g., milliliter quantities), and can 
produce large crystals. For that reason, the batch technique is not 
recommended for screening initial crystallization conditions. 

20 The dialysis technique is performed by diffusing precipitant molecules 

through a semipermeable membrane to slowly increase the concentration of 
the solute inside the membrane. Dialysis tubing can be used to dialyze milliliter 
quantities of sample, whereas dialysis buttons can be used to dialyze microliter 
quantities (e.g., 7-200 (al). Dialysis buttons may be constructed out of glass, 

25 perspex, or Teflon™ (see, e.g., Cambridge Repetition Engineers Ltd., Greens 
Road, Cambridge CB4 3EQ, UK; Hampton Research). Using this method, the 
precipitating solution can be varied by moving the entire dialysis button or sack 
into a different solution, in this way, polypeptides can be "reused" until the 
correct conditions for crystallization are found (see, e.g., C.W. Carter, Jr. et al., 

30 1988, J. Cryst Growth. 90:60-73). However, this method is not recommended 
for precipitants comprising concentrated PEG solutions. 
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Various strategies have been designed to screen crystallization 
conditions, including 1) pi screening; 2) grid screening; 3) factorials; 4) 
solubility assays; 5) perturbation; and 6) sparse matrices. In accordance with 
the pi screening method, the pi of a polypeptide is presumed to be its 
5 crystallization point. Screening at the pi can be performed by dialysis against 
low concentrations of buffer (less than 20 mM) at the appropriate pH, or by use 
of conventional precipitants. 

The grid screening method can be performed on two-dimensional 
matrices. Typically, the precipitant concentration is plotted against pH. The 

10 optimal conditions can be determined for each axis, and then combined. At 
that point, additional factors can be tested (e.g., temperature, additives). This 
method works best with fast-forming crystals, and can be readily automated 
(see M.J. Cox and P.C. Weber, 1988, J. Cryst. Growth. 90:318-324). Grid 
screens are commercially available for popular precipitants such as ammonium 

15 sulphate, PEG 6000, MPD, PEG/LiCI, and NaCl (see, e.g., Hamilton 
Research). 

The incomplete factorial method can be performed by 1) selecting a set 
of -20 conditions; 2) randomly assigning combinations of these conditions; 3) 
grading the success of the results of each experiment using an objective scale; 

20 and 4) statistically evaluating the effects of each of the conditions on crystal 
formation (see, e.g., C.W. Carter, Jr. et aL, 1988, J. Cryst Growth. 90:60-73). 
In particular, conditions such as pH, temperature, precipitating agent, and 
cations can be tested. Dialysis buttons are preferably used with this method. 
Typically, optimal conditions/combinations can be determined within 35 tests. 

25 Similar approaches, such as "footprinting" conditions, may also be employed 
(see, e.g., E.A. Stura et ah, 1991, J. Cryst Growth, 110:1-2). 

The perturbation approach can be performed by altering crystallization 
conditions by introducing a series of additives designed to test the effects of 
altering the structure of bulk solvent and the solvent dielectric on crystal 

30 formation (see, e.g., Whitaker et a!., 1995, Biochem. 34:8221-8226). Additives 
for increasing the solvent dialectric include, but are not limited to, NaCl, KCI, 

-52- 



WO 01/78894 



PCT/US01/12245 



or LiCI (e.g., 200 mM); Na formate (e.g., 200 mM); Na 2 HP0 4 or K 2 HP0 4 (e.g., 
200 mM); urea, triachloroacetate, guanidium HCI, or KSCN (e.g., 20-50 mM). 
A non-limiting list of additives for decreasing the solvent dialectric include 
methanol, ethanol, isopropanoJ, or tert-butano) (e.g., 1-5%); MPD (e.g., 1%); 
5 PEG 400, PEG 600, or PEG 1000 (e.g., 1-4%); PEG MME (monomethylether) 
550, PEG MME 750, PEG MME 2000 (e.g., 1-4%). 

As an alternative to the above-screening methods, the sparse matrix 
approach can be used (see, e.g., J. Jancarik and S.-H.J. Kim, 1991, >App/. 
Cryst 24:409-411; A. McPherson, 1992, J. Cryst Growth. 122:161-167; B. 

10 Cudney et ah, 1994, Acta. Cryst D50:41 4-423). Sparse matrix screens are 
commercially available (see, e.g., Hampton Research; Molecular Dimensions, 
Inc., Apopka, FL; Emerald Biostructures, Inc., Lemont, JL). Notably, data from 
Hampton Research sparse matrix screens can be stored and analyzed using 
ASPRUN software (Douglas Instruments). 

1 5 Exemplary conditions for an initial screen are shown below (see Berry, 

1995). 

TABLE 1 

Tray 1 : 



PEG 8000 (wells 1-6) 


Ammonium sulfate (wells 7-12) 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


20% 


20% 


20% 


35% 


35% 


35% 


2.0 M 


2.0 M 


2.0 M 


2.5 M 


2.5 M 


2.5 M 


pH 5.0 


pH 7.0 


pH 8.6 


L pH 5.0 


pH 7.0 


pH8.6 


PH5.0 


pH7.0 


pH 8.8 


pH5.0 


PH7.0 


pH 8.8 


MPD (wefts 13-16) 


Na Citrate (wells 17-20) 


Na/K Phosphate (wells 21-24) 


13 


14 


15 


16 


17 


18 


19 


20 


21 


22 


23 


24 


30% 


30% 


50% 


50% 


1.3 M 


1.3 M 


1.5 M 


1.5 M 


2.0 M 


2.0 M 


2.5 M 


2.5 M 


pH 5.8 


pH 7.6 


pH 5.8 


pH 7.6 


pH 5.8 


PH7.5 


pH 5.8 


pH 7.5 


pH 6.0 


PH7.4 


PH6.0 


pH 7.4 



20 Tray 2: 



PEG 2000 MME/0.2 M Ammon. sulfate (wells 25-30) 



25 


26 


27 


28 


29 


30 


25% 
pH 5.5 


25% 
pH 7.0 


25% 
pH8.5 


40% 
pH 5.5 


40% 
PH7.0 


40% 
pH 8.5 



Random for wells 31 to 48 



The initial screen can be used with hanging or sitting drops. To 
conserve the sample, tray 2 can be set up several weeks following tray 1. 
Wells 31-48 of tray 2 can comprise a random set of solutions. Alternatively, 
25 solutions can be formulated using sparse methods. Preferably, test solutions 
cover a broad range of precipitants, additives, and pH (especially pH 5.0-9.0). 
Seeding can be used to trigger nucleation and crystal growth (Stura and 
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Wilson, 1990, J. Cryst Growth. 110:270-282; C. Thaller et al., 1981, J. Mol. 
Biol. 147:465-469; A. McPherson and P. Schlichta, 1988, J. Cryst Growth. 
90:47-50). In general, seeding can performed by transferring crystal seeds into 
a polypeptide solution to allow polypeptide molecules to deposit on the surface 
5 of the seeds and produce crystals. Two seeding methods can be used: 
microseeding and macroseeding. For microseeding, a crystal can be ground 
into tiny pieces and transferred into the protein solution. Alternatively, seeds 
can be transferred by adding 1-2 \i\ of the seed solution directly to the 
equilibrated protein solution. In another approach, seeds can be transferred 

10 by dipping a hair in the seed solution and then streaking the hair across the 
surface of the drop (streak seeding; see Stura and Wilson, supra). For 
macroseeding, an intact crystal can be transferred into the protein solution 
(see, e.g., C. Thaller et al., 1981, J. Mol. Biol. 147:465-469). Preferably, the 
surface of the crystal seed is washed to regenerate the growing surface prior 

15 to being transferred. Optimally, the protein solution for crystallization is close 
to saturation and the crystal seed is not completely dissolved upon transfer. 
Antibodies 

An isolated Gene 216 polypeptide or a portion or fragment thereof, can 
be used as an immunogen to generate anti-Gene 216 antibodies using 

20 standard techniques for polyclonal and monoclonal antibody preparation. The 
full-length Gene 21,6 polypeptide can be used or, alternatively, the invention 
provides antigenic peptide fragments of Gene 216 for use as immunogens. 
The antigenic peptide of Gene 216 comprises at least 5 amino acid residues 
of the amino acid sequence shown in SEQ ID NO:4, and encompasses an 

25 epitope of Gene 216 such that an antibody raised against the peptide forms a 
specific immune complex with Gene 216 amino acid sequence. 

Accordingly, another aspect of the invention pertains to anti-Gene 216 
antibodies. The invention provides polyclonal and monoclonal antibodies that 
bind Gene 216 polypeptides or peptides. The term "monoclonal antibody" or 

30 "monoclonal antibody composition", as used herein, refers to a population of 
antibody molecules that contain only one species of an antigen binding site 
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capable of immunoreacting with a particular epitope of a Gene 216 polypeptide 
or peptide. A monoclonal antibody composition thus typically displays a single 
binding affinity for a particular Gene 216 polypeptide or peptide with which it 
immunoreacts. 

5 A Gene 216 immunogen typically is used to prepare antibodies by 

immunizing a suitable subject, (e.g., rabbit, goat, mouse, or other non-human 
mammal) with the immunogen. An appropriate immunogenic preparation can 
contain, for example, recombinantly expressed Gene 216 polypeptide or a 
chemically synthesized Gene 216 polypeptide, or fragments thereof. The 

10 preparation can further include an adjuvant, such as Freund's complete or 
incomplete adjuvant, or similar immunostimulatory agent. Immunization of a 
suitable subject with an immunogenic Gene 216 preparation induces a 
polyclonal anti-Gene 216 antibody response. 

A number of adjuvants are known and used by those skilled in the art. 

15 Non-limiting examples of suitable adjuvants include incomplete Freund's 
adjuvant, mineral gels such as alum, aluminum phosphate, aluminum 
hydroxide, aluminum silica, and surface-active substances such as lysoJec/tnin, 
pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet 
hemocyanin, and dinitrophenol. Further examples of adjuvants include N- 

20 acetyl-muramyl-L-threonyl-D-isoglutamine (thr-MDP), N-acetyl-nor-muramyl-L- 
alanyl-D-isoglutamine (CGP 11637, referred to as nor-MDP), N-acetylmuramyl- 
Lalanyl-D-isoglutaminyl-L-alanine^^l^-dipalmitoyl-sn-glycero-S 
hydroxyphosphoryloxy)-ethylamine (CGP 19835A, referred to as MTP-PE), and 
RIBI, which contains three components extracted from bacteria, 

25 monophosphoryl lipid A, trehalose dimycolate and cell wall skeleton 
(MPL+TDM+CWS) in a 2% squalene/Tween 80 emulsion. A particularly useful 
adjuvant comprises 5% (wt/vol) squalene, 2.5% Pluronic L121 polymer and 
0.2% polysorbate in phosphate buffered saline (Kwak et al., 1992, New Eng. 
J. Med. 327:1209-1215). Preferred adjuvants include complete BCG, Detox, 

30 (RIBI, Immunochem Research Inc.), ISCOMS, and aluminum hydroxide 
adjuvant (Superphos, Biosector). The effectiveness of an adjuvant may be 
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determined by measuring the amount of antibodies directed against the 
immunogenic peptide. 

Polyclonal anti-Gene 216 antibodies can be prepared as described 
above by immunizing a suitable subject with a Gene 216 immunogen. The 
5 anti-Gene 216 antibody titer in the immunized subject can be monitored over 
time by standard techniques, such as with an enzyme linked immunosorbent 
assay (ELISA) using immobilized Gene 21 6. If desired, the antibody molecules 
directed against Gene 216 can be isolated from the mammal (e.g., from the 
blood) and further purified by well-known techniques, such as protein A 

10 chromatography to obtain the IgG fraction. 

At an appropriate time after immunization, e.g., when the anti-Gene 216 
antibody titers are highest, antibody-producing cells can be obtained from the 
subject and used to prepare monoclonal antibodies by standard techniques, 
such as the hybridoma technique (see Kohler and Milstein, 1975, Nature 

15 256:495-497; Brown et al., 1981, J. Immunol. 127:539-46; Brown et al., 1980, 
J. Biol. Chem. 255:4980-83; Yeh et al., 1976, PNAS 76:2927-31; and Yeh et 
al., 1982, Int. J. Cancer 29:269-75), the human B cell hybridoma technique 
(Kozbor et al., 1983, Immunol. Today 4:72), the EBV-hybridoma technique 
(Cole et al., 1985, Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, 

20 Inc., pp. 77-96) or trioma techniques. 

The technology for producing hybridomas is well-known (see generally 
R. H. Kenneth, 1980, Monoclonal Antibodies: A New Dimension In Biological 
Analyses, Plenum Publishing Corp., New York, NY; E.A. Lemer, 1981, Yale J. 
Biol. Med., 54:387-402; M.L. Gefter et ai., 1977, Somatic Cell Genet 3:231- 

25 36). In general, an immortal cell line (typically a myeloma) is fused to 
lymphocytes (typically splenocytes) from a mammal immunized with a Gene 
216 immunogen as described above, and the culture supematants of the 
resulting hybridoma cells are screened to identify a hybridoma producing a 
monoclonal antibody that binds Gene 216 polypeptides or peptides. 

30 Any of the many well known protocols used for fusing lymphocytes and 

immortalized cell lines can be applied for the purpose of generating an anti- 
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Gene 216 monoclonal antibody (see, e.g., G. Galfre et al. f 1977, Nature 
266:55052; Gefter et al., 1977; Lemer, 1981; Kenneth, 1980). Moreover, the 
ordinarily skilled worker will appreciate that there are many variations of such 
methods. Typically, the immortal cell line (e.g., a myeloma cell line) is derived 
5 from the same mammalian species as the lymphocytes. For example, murine 
hybridomas can be made by fusing lymphocytes from a mouse immunized with 
an immunogenic preparation of the present invention with an immortalized 
mouse cell line. Preferred immortal cell lines are mouse myeloma cell lines 
that are sensitive to culture medium containing hypoxanthine, aminopterin, and 

1 0 thymidine (HAT medium). Any of a number of myeloma cell lines can be used 
as a fusion partner according to standard techniques, e.g., the P3-NS1/1~Ag4- 
1, P3-x63-Ag8.653, or Sp2/0-Ag14 myeloma lines. These myeloma lines are 
available from ATCC (American Type Culture Collection, Manassas, VA). 
Typically, HAT-sensitive mouse myeloma cells are fused to mouse splenocytes 

1 5 using polyethylene glycol (PEG). Hybridoma cells resulting from the fusion arc 
then selected using HAT medium, which kills unfused and unproductively fused 
myeloma cells (unfused splenocytes die after several days because they are 
not transformed). Hybridoma cells producing a monoclonal antibody of the 
invention are detected by screening the hybridoma culture supernatants for 

20 antibodies that bind Gene 21 6 polypeptides or peptides, e.g., using a standard 
ELISA assay. 

Alternative to preparing monoclonal antibody-secreting hybridomas, a 
monoclonal anti-Gene 216 antibody can be identified and isolated by screening 
a recombinant combinatorial immunoglobulin library (e.g., an antibody phage 

25 display library) with Gene 216 to thereby isolate immunoglobulin library 
members that bind Gene 216. Kits for generating and screening phage display 
libraries are commercially available (e.g., the Pharmacia Recombinant Phage 
Antibody System, Catalog No. 27-9400-01; and the Stratagene SurfZAP™ 
Phage Display Kit, Catalog No. 240612). 

30 Additionally, examples of methods and reagents particularly amenable 

for use in generating and screening antibody display library can be found in, 
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for example, Ladner et al. U.S. Pat. No. 5,223,409; Kang et al. PCT 
International Publication No. WO 92/18619; Dower et al. PCT International 
Publication No. WO 91/17271 ; Winter et al. PCT International Publication WO 
92/20791; MarkJand et al PCT International Publication No. WO 92/15679; 
5 Breitling et al. PCT International Publication WO 93/01288; McCafferty et al. 
PCT International Publication No. WO 92/01047; Garrard et al. PCT 
International Publication No. WO 92/09690; Ladner et al. PCT International 
Publication No. WO 90/02809; Fuchs et al., 1991, Bio/Technology 9:1370- 
1372; Hay et al., 1992, Hum. Antibod. Hybridomas 3:81-85; Huse et al., 1989, 

10 Science 246:1275-1281 ; Griffiths et al., 1993, EMBO J 12:725-734; Hawkins 
et al., 1992, J. Mol. Biol 226:889-896; Clarkson et al., 1991, Nature 352:624- 
628; Gram et al., 1992, PNAS 89:3576-3580; Garrad et al., 1991, 
Bio/Technology 9:1373-1377; Hoogenboom et al., 1991, Nuc. Acid Res. 
19:4133-4137; Barbas et al., 1991, PNAS 88:7978-7982; and McCafferty et al., 

1 5 1 990, Nature 348:552-55. 

Additionally, recombinant anti-Gene 216 antibodies, such as chimeric 
and humanized monoclonal antibodies, comprising both human and non- 
human portions, which can be made using standard recombinant DNA 
techniques, are within the scope of the invention. Such chimeric and 

20 humanized monoclonal antibodies can be produced by recombinant DNA 
techniques known in the art, for example using methods described in Robinson 
et al. International Application No. PCT/US86/02269; Akira, et al. European 
Patent Application 184,187; Taniguchi, M., European Patent Application 
171,496; Morrison etal. European Patent Application 173,494; Neubergeret 

25 al. PCT International Publication No. WO 86/01 533; Cabilly et a!. U.S. Pat. No. 
4,816,567; Cabilly et al. European Patent Application 125,023; Better et al., 
1988, Science 240:1041-1043; Liu etal., 1987, PNAS 84:3439-3443; Liu etal., 
1987, J. Immunol. 139:3521-3526; Sun et al., 1987, PNAS 84:214-218; 
Nishimura et al., 1987, Cane. Res. 47:999-1 005;" Wood et al., 1985, Nature 

30 314:446-449; and Shaw et al., 1988, J. Natl. Cancer Inst. 80:1 553-1559; S.L. 
Morrison, 1985, Science 229:1202-1207; Oi et al., 1986, BioTechniques 4:214; 

-58- 



WO 01/78894 



PCT/US01/12245 



Winter U.S. Pat. No. 5,225,539; Jones et al., 1986, Nature 321:552-525; 
Verhoeyan et al., 1988, Science 239:1534; and Bcidler et al., 1988, J. 
Immunol. 1 41 :4053-4060 . 

An anti-Gene 216 antibody (e.g., monoclonal antibody) can be used to 
5 isolate Gene 21 6 by standard techniques, such as affinity chromatography or 
immunoprecipitation. An anti-Gene 216 antibody can also facilitate the 
purification of natural Gene 216 polypeptide from cells and of recombinantly 
produced Gene 216 polypeptides or peptides expressed in host cells. Further, 
an anti-Gene 216 antibody can be used to detect Gene 216 protein (e.g., in a 

10 cellular lysate or cell supernatant) in order to evaluate the abundance and 
pattern of expression of the Gene 216 protein. Anti-Gene 216 antibodies can 
be used diagnostically to monitor protein levels in tissue as part of a clinical 
testing procedure, e.g., to, for example, determine the efficacy of a given 
treatment regimen as described in detail herein. In addition, and anti-Gene 

15 216 antibody can be used as therapeutics for the treatment of diseases related 
to abnormal Gene 216 expression or function, e.g., asthma. 
Liqands 

The Gene 216 polypeptides, polynucleotides, variants, or fragments 
thereof, can be used to screen for ligands (e.g., agonists, antagonists, or 

20 inhibitors) that modulate the levels or activity of the Gene 21 6 polypeptide. In 
addition, these Gene 216 molecules can be used to identify endogenous 
ligands that bind to Gene 216 polypeptides or polynucleotides in the cell. In 
one aspect of the present invention, the full-length Gene 216 polypeptide (e.g., 
SEQ l[>NO:4) is used to identify ligands. Alternatively, variants or fragments 

25 of a Gene 216 polypeptide are used. Such fragments may comprise, for 
example, one or more domains of the Gene 216 polypeptide (e.g., the pre-, 
pro-, catalytic, cysteine-rich, disintegrin, EGF, transmembrane, and cytoplasmic 
domains) disclosed herein. Of particular interest are screening assays that 
identify agents that have relatively low levels of toxicity in human cells. A wide 

30 variety of assays may be used for this purpose, including in vitro protein-protein 
binding assays, electrophoretic mobility shift assays, immunoassays, and the 
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like. 

The term "figand" as used herein describes any molecule, protein, 
peptide, or compound with the capability of directly or indirectly altering the 
physiological function, stability, or levels of the Gene 216 polypeptide. Ligands 
5 that bind to the Gene 216 polypeptides or polynucleotides of the invention are 
potentially useful in diagnostic applications and/or pharmaceutical 
compositions, as described in detail herein. Ligands may encompass 
numerous chemical classes, though typically they are organic molecules, 
preferably small organic compounds having a molecular weight of more than 

10 50 and less than about 2,500 daltons. Such ligands can comprise functional 
groups necessary for structural interaction with proteins, particularly hydrogen 
bonding, and typically include at least an amine, carbonyl, hydroxy! or carboxyl 
group, preferably at least two of the functional chemical groups. Ligands often 
comprise cyclical carbon or heterocyclic structures and/or aromatic or 

15 polyaromatic structures substituted with one or more of the above functional 
groups. Ligands can also comprise biomolecules including peptides, 
saccharides, fatty acids, steroids, purines, pyrimidines, derivatives, structural 
analogs, or combinations thereof. 

Ligands may include, for example, 1) peptides such as soluble peptides, 

20 including Ig-tailed fusion peptides and members of random peptide libraries 
(see, e.g., Lam et a)., 1991, Nature 354:82-84; Houghten et al., 1991, Nature 
354:84-86) and combinatorial chemistry-derived molecular libraries made of D- 
and/or L-configu ration amino acids; 2) phosphopeptides (e.g., members of 
random and partially degenerate, directed phosphopeptide libraries, see, e.g., 

25 Songyang et al, 1993, Cell 72:767-778); 3) antibodies (e.g., polyclonal, 
monoclonal, humanized, anti-idiotypic, chimeric, and single chain antibodies 
as well as Fab, F(ab') 2 , Fab expression library fragments, and epitope-binding 
fragments of antibodies); and 4) small organic and inorganic molecules. 

Ligands can be obtained from a wide variety of sources including 

30 libraries of synthetic or natural compounds. Synthetic compound libraries are 
commercially available from, for example, Maybndge Chemical Co. (Trevillet, 
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Cornwall, UK), Comgenex (Princeton, NJ), Brandon Associates (Merrimack, 
NH), and Microsource (New Milford, CT). A rare chemical library is available 
from Aldrich Chemical Company, Inc. (Milwaukee, Wl). Natural compound 
libraries comprising bacterial, fungal, plant or animal extracts are available 
5 from, for example, Pan Laboratories (Bothell, WA). In addition, numerous 
means are available for random and directed synthesis of a wide variety of 
organic compounds and biomolecules, including expression of randomized 
oligonucleotides. 

Alternatively, libraries of natural compounds in the form of bacterial, 

10 fungal, plant and animal extracts can be readily produced. Methods for the 
synthesis of molecular libraries are readily available (see, e.g., DeWitt et al., 
1993, Prvc. Natl. Acad. Sci. USA 90:6909; Erb et al., 1994, Proc. Natl. Acad. 
Sci USA 91:1 1422; Zuckermann et al., 1994, J. Med. Chem. 37:2678; Cho et 
aL, 1993, Science 261:1303; Carell et al., 1994, Angew. Chem. Int. Ed. Engl. 

15 33:2059; Carell et al., 1994, Angew. Chem. Int. Ed. Engl. 33:2061; and In 
Gallop et aL, 1994, J. Med. Chem. 37:1233). In addition, natural or synthetic 
compound libraries and compounds can be readily modified through 
conventional chemical, physical and biochemical means (see, e.g., Blondelle 
et aL, 1996, Trends in Biotech. 14:60), and may be used to produce 

20 combinatorial libraries. In another approach, previously identified 
pharmacological agents can be subjected to directed or random chemical 
modifications, such as acylation, alkylation, esterification, amidification, and the 
analogs can be screened for Gene 216-modulating activity. 

Numerous methods for producing combinatorial libraries are known in 

25 the art, including those involving biological libraries; spatially addressable 
parallel solid phase or solution phase libraries; synthetic library methods 
requiring deconvolution; the 'one-bead one-compound 5 library method; and 
synthetic library methods using affinity chromatography selection. The 
biological library approach is limited to polypeptide libraries, while the other four 

30 approaches are applicable to polypeptide, non-peptide oligomer, or small 
molecule libraries of compounds (K. S. Lam, 1997, Anticancer Drug Des. 
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12:145). 

Libraries may be screened in solution (e.g., Houghten, 1992, 
Biotechniques 13:412-421), or on beads (Lam, 1991 , Nature 354:82-84), chips 
(Fodor, 1993, Nature 364:555-556), bacteria or spores (LadnerU.S. Pat. No. 
5 5,223,409), plasmids (Cull et a!., 1992, Proc. Nati Acad. ScL USA 89:1865- 
1869), or on phage (Scott and Smith, 1990, Science 249:386-390; Devlin, 
1990, Science 249:404-406; Cwirla et af., 1990, Proc. Natl. Acad. Sci. USA 
97:6378-6382; Felici, 1991, J. Mol. Biol. 222:301-310; Ladner, supra). 

Where the screening assay is a binding assay, a Gene 216 polypeptide, 

1 0 polynucleotide, analog, or fragment thereof, may be joined to a label, where the 
label can directly or indirectly provide a detectable signal. Various labels 
include radioisotopes, fluorescers, chemiluminescers, enzymes, specific 
binding molecules, particles, e.g. magnetic particles, and the like. Specific 
binding molecules include pairs, such as biotin and streptavidin, digoxin and 

15 antidrgoxin, etc. For the specific binding members, the complementary 
member would normally be labeled with a molecule that provides for detection, 
in accordance with known procedures. 

A variety of other reagents may be included in the screening assay. 
These include reagents like salts, neutral proteins, e.g. albumin, detergents, 

20 etc., that are used to facilitate optimal protein-protein binding and/or reduce 
non-specific or background interactions. Reagents that improve the efficiency 
of the assay, such as protease inhibitors, nuclease inhibitors, anti-microbial 
agents, etc., may be used. The components are added in any order that 
produces the requisite binding. Incubations are performed at any temperature 

25 that facilitates optimal activity, typically between 4° and 40°C. Incubation 
periods are selected for optimum activity, but may also be optimized to 
facilitate rapid high-throughput screening. Normally, between 0.1 and 1 hrwill 
be sufficient. In general, a plurality of assay mixtures is run in parallel with 
different agent concentrations to obtain a differential response to these 

30 concentrations. Typically, one of these concentrations serves as a negative 
control, i.e. at zero concentration or below the level of detection. 

-62- 



WO 01/78894 



PCT/US01/12245 



To perform cell-free J/gand screening assays, it may be desirable to 
immobilize either the Gene 216 polypeptide, polynucleotide, or fragment to a 
surface to facilitate identification of ligands that bind to these molecules, as 
well as to accommodate automation of the assay. For example, a fusion 
5 protein comprising a Gene 216 polypeptide and an affinity tag can be 
produced. In one embodiment, a glutathione-S-transferase/phosphodiesterase 
fusion protein comprising a Gene 216 polypeptide is adsorbed onto glutathione 
sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione-derivatized 
microtiter plates. Cell lysates (e.g., containing 35 S-labeled polypeptides) are 

10 added to the Gene 216-coated beads under conditions to allow complex 
formation (e.g., at physiological conditions for salt and pH). Following 
incubation, the Gene 216-coated beads are washed to remove any unbound 
polypeptides, and the amount of immobilized radiolabel is determined. 
Alternatively, the complex is dissociated and the radiolabel present in the 

15 supernatant is determined. In another approach, the beads are analyzed by 
SDS-PAGE to identify Gene 216-binding polypeptides. 

Ligand-binding assays can be used to identify agonist or antagonists 
that alter the function or levels of the Gene 21 6 polypeptide. Such assays are 
designed to detect the interaction of test agents with Gene 216 polypeptides, 

20 polynucleotides, analogs, or fragments thereof. Interactions may be detected 
by direct measurement of binding. Alternatively, interactions may be detected 
by indirect indicators of binding, such as stabilization/destabilization of protein 
structure, or activation/inhibition of biological function. Non-limiting examples 
of useful ligand-binding assays are detailed below. 

25 Ligands that bind to Gene 216 polypeptides, polynucleotides, analogs, 

or fragments thereof, can be identified using real-time Bimolecular Interaction 
Analysis (BIA; Sjolander et al., 1991 , Anal. Chem. 63:2338-2345; Szabo et al., 
1995, Curr. Opin. Struct. Biol. 5:699-705). BlA-based technology (e.g., 
BIAcore™; LKB Pharmacia, Sweden) allows study of biospecific interactions 

30 in real time, without labeling. In BIA, changes in the optical phenomenon 
surface plasmon resonance (SPR) is used determine real-time interactions of 
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biological molecules. 

Ligands can also be identified by scintillation proximity assays (SPA, 
described in U.S. Patent No. 4,568,649). In a modification of this assay that 
is currently undergoing development, chaperonins are used to distinguish 
5 folded and unfolded proteins. A tagged protein is attached to SPA beads, and 
test agents are added. The bead is then subjected to mild denaturing 
conditions (such as, e.g., heat, exposure to SDS, etc.) and a purified labeled 
chaperonin is added. If a test agent binds to a target, the labeled chaperonin 
will not bind; conversely, if no test agent binds, the protein will undergo some 

1 0 degree of denaturation and the chaperonin will bind. 

Ligands can also be identified using a binding assay based on 
mitochondrial targeting signals (Hurt et al., 1985, EMBO J. 4:2061-2068; Eilers 
and Schatz, 1986, Nature 322:228-231). In a mitochondrial import assay, 
expression vectors are constructed in which nucleic acids encoding particular 

1 5 target proteins are inserted downstream of sequences encoding mitochondrial 
import signals. The chimeric proteins are synthesized and tested for their 
ability to be imported into isolated mitochondria in the absence and presence 
of test compounds. A test compound that binds to the target protein should 
inhibit its uptake into isolated mitochondria in vitro. 

20 The iigand-binding assay described in Fodor et al., 1991, Science 

251:767-773, which involves testing the binding affinity of test compounds for 
a plurality of defined polymers synthesized on a solid substrate, can also be 
used. 

Ligands that bind to Gene 216 polypeptides or peptides can be 
25 identified using two-hybrid assays (see, e.g., U.S. Pat. No. 5,283,317; Zervos 
et al., 1993, Cell 72:223-232; Madura et al., 1993, J. Biol. Chem. 268:12046- 
12054; Bartel et al., 1993, Biotechniques 14:920-924; Iwabuchi et al., 1993, 
Oncogene 8:1693-1696; and Brent WO 94/10300). The two-hybrid system 
relies on the reconstitution of transcription activation activity by association of 
30 the DNA-binding and transcription activation domains of a transcriptional 
activator through protein-protein interaction. The yeast GAL4 transcriptional 
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activator may be used in this way, although other transcription factors have 
been used and are well known in the art. To carryout the two-hybrid assay, the 
GAL4 DNA-binding domain, and the GAL4 transcription activation domain are 
expressed, separately, as fusions to potential interacting polypeptides. 
5 In one embodiment, the "bait" protein comprises a Gene 216 

polypeptide fused to the GAL4 DNA-binding domain. The "fish" protein 
comprises, for example, a human cDNA library encoded polypeptide fused to 
the GAL4 transcription activation domain. If the two, coexpressed fusion 
proteins interact in the nucleus of a host cell, a reporter gene (e.g. LacZ) is 

1 0 activated to produce a detectable phenotype. The host cells that show two- 
hybrid interactions can be used to isolate the containing plasmids containing 
the cDNA library sequences. These plasmids can be analyzed to determine 
the nucleic acid sequence and predicted polypeptide sequence of the 
candidate ligand. Alternatively, methods such as the three-hybrid (Licitra et al., 

15 1996, Proa Natl. Acad. Sci. USA 93:12817-12821), and reverse two-hybrid 
(Vidal et al., 1996, Proa Natl. Acad. Sci. USA 93:10315-10320) systems may 
be used. Commercially available two-hybrid systems such as the CLONTECH 
Matchmaker™ systems and protocols (CLONTECH Laboratories, Inc., Palo 
Alto, CA) may be also be used (see also, A.R. Mendelsohn et al., 1994, Curr. 

20 Op. Biotech. 5:482; E.M. Phizicky et al., 1995, Microbiological Rev. 59:94; M. 
Yang et al., 1995, Nucleic Acids Res. 23:1152; S. Fields et al., 1994, Trends 
Genet. 10:286; and U.S. Patent No. 6,283,173 and 5,468,614). 

Several methods of automated assays have been developed in recent 
years so as to permit screening of tens of thousands of test agents in a short 

25 period of time. High-throughput screening methods are particularly preferred 
for use with the present invention. The ligand-binding assays described herein 
can be adapted for high-throughput screens, or alternative screens may be 
employed. For example, continuous format high throughput screens (CF-HTS) 
using at least one porous matrix allows the researcher to test large numbers 

30 of test agents for a wide range of biological or biochemical activity (see United 
States Patent No. 5,976,813 to Beutel et al.). Moreover, CF-HTS can be used 
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to perform multi-step assays. 
Diagnostics 

As discussed herein, chromosomal region 20p13-p12 has been 
genetically linked to a variety of diseases and disorders, including asthma. 
5 The present invention provides nucleic acids and antibodies that can be 
useful in diagnosing individuals with aberrant Gene 216 expression. In 
particular, the disclosed SNPs can be used to diagnose chromosomal 
abnormalities linked to these diseases. 

Antibody-based diagnostic methods : In a further embodiment of the 

10 present invention, antibodies which specifically bind to the Gene 216 
polypeptide may be used for the diagnosis of conditions or diseases 
characterized by underexpression or overexpression of the Gene 216 
polynucleotide or polypeptide, or in assays to monitor patients being treated 
with a Gene 216 polypeptide or peptide, or a Gene 216 agonist, antagonist, 

15 or inhibitor. 

The antibodies useful for diagnostic purposes may be prepared in the 
same manner as those for use in therapeutic methods, described herein. 
Antibodies may be raised to the full-length Gene 216 polypeptide sequence 
(e.g., SEQ ID NO:4). Alternatively, the antibodies may be raised to 

20 fragments or variants of the Gene 216 polypeptide. In one aspect of the 
invention, antibodies are prepared to bind to a Gene 216 polypeptide 
fragment comprising one or more domains of the Gene 216 polypeptide 
(e.g., pre-, pro-, catalytic, disintegrin, cysteine-rich, EGF, transmembrane, 
and cytoplasmic domains) described herein. 

25 Diagnostic assays for the Gene 216 polypeptide include methods that 

utilize the antibody and a label to detect the protein in biological samples 
(e.g., human body fluids, cells, tissues, or extracts of cells or tissues). The 
antibodies may be used with or without modification, and may be labeled by 
joining them, either covalently or non-covalently, with a reporter molecule. 

30 A wide variety of reporter molecules that are known in the art may be used, 
several of which are described herein. 
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The invention provides methods for detecting disease-associated 
antigenic components in a biological sample, which methods comprise the 
steps of: 1 ) contacting a sample suspected to contain a disease-associated 
antigenic component with an antibody specific for an disease-associated 
5 antigen, extracellular or intracellular, under conditions in which an antigen- 
antibody complex can form between the antibody and disease-associated 
antigenic components in the sample; and 2) detecting any antigen-antibody 
complex formed in step (1 ) using any suitable means known in the art, wherein 
the detection of a complex indicates the presence of disease-associated 
10 antigenic components in the sample. It will be understood that assays that 
utilize antibodies directed against altered Gene 216 amino acid sequences 
(i.e., epitopes encoded by SNPs, mutations, or variants) are within the scope 
of the invention. 

Many immunoassay formats are known in the art, and the particular 

15 format used is determined by the desired application. An immunoassay can 
use, for example, a monoclonal antibody directed against a single disease- 
associated epitope, a combination of monoclonal antibodies directed against 
different epitopes of a single disease-associated antigenic component, 
monoclonal antibodies directed towards epitopes of different disease- 

20 associated antigens, polyclonal antibodies directed towards the same 
disease-associated antigen, or polyclonal antibodies directed towards 
different disease-associated antigens. Protocols can also, for example, use 
solid supports, or may involve immunoprecipitation. 

In accordance with the present invention, "competitive" (U.S. Pat. Nos. 

25 3,654,090 and 3,850,752), "sandwich" (U.S. Pat. No. 4,016,043), and "double 
antibody," or "DASP" assays may be used. Several procedures for measuring 
the Gene 216 polypeptide (e.g., ELISA, RIA, and FACS) are known in the art 
and provide a basis for diagnosing altered or abnormal levels of Gene 216 
polypeptide expression. Normal or standard values for Gene 216 polypeptide 

30 expression are established by incubating biological samples taken from normal 
subjects, preferably human, with antibody to the Gene polypeptide under 
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conditions suitable for complex formation. The amount of standard complex 
formation may be quantified by various methods; photometric means are 
preferred. Levels of the Gene 216 polypeptide expressed in the subject 
sample, negative control (normal) sample, and positive control (disease) 
5 sample are compared with the standard values. Deviation between standard 
and subject values establishes the parameters for diagnosing disease. 

Typically, immunoassays use either a labeled antibody or a labeled 
antigenic component (e.g., that competes with the antigen in the sample for 
binding to the antibody). A number of fluorescent materials are known and 

10 can be utilized as labels for antibodies or polypeptides. These include, for 
example, Cy3, Cy5, Alexa, BODIPY, fluorescein (e.g., FluorX, DTAF, and 
FITC), rhodamine (e.g., TRITC), auramine, Texas Red, AMCA blue, and 
Lucifer Yellow. Antibodies or polypeptides can also be labeled with a 
radioactive element or with an enzyme. Preferred isotopes include 3 H, 14 

15 C, 32 P, 35 S, 36 CI, 51 Cr, 57 Co, 58 Co, 59 Fe, 90 Y, 125 I, 131 J, and 186 Re. 
Preferred enzymes include peroxidase, p-glucuronidase, p-D-g!ucosidase, 
p-D-galactosidase, urease, glucose oxidase plus peroxidase, and alkaline 
phosphatase (see, e.g., U.S. Pat. Nos. 3,654,090; 3,850,752 and 
4,016,043). Enzymes can be conjugated by reaction with bridging 

20 molecules such as carbodiimides, diisocyanates, gfutaraldehyde, and the 
like. Enzyme labels can be detected visually, or measured by calorimetric, 
spectrophotometric, fluorospectrophotometric, amperometric, or gasometric 
techniques. Other labeling systems, such as avidin/biotin, Tyramide Signal 
Amplification (TSA™), are known in the art, and are commercially available 

25 (see, e.g., ABC kit, Vector Laboratories, Inc., Burlingame, CA; NEN® Life 
Science Products, Inc., Boston, MA). 

Kits suitable for antibody-based diagnostic applications typically 
include one or more of the following components: 

(1) Antibodies: The antibodies may be pre-labeled; alternatively, the 

30 antibody may be unlabeled and the ingredients for labeling may be included 
in the kit in separate containers, or a secondary, labeled antibody is 
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provided; and 

(2) Reaction components: The kit may also contain other suitably 
packaged reagents and materials needed for the particular immunoassay 
protocol, including solid-phase matrices, if applicable, and standards. 
5 The kits referred to above may include instructions for conducting the 

test. Furthermore, in preferred embodiments, the diagnostic kits are 
adaptable to high-throughput and/or automated operation. 

Nucleic-acid-based diagnostic methods : The invention provides 
methods for altered levels or sequences of Gene 216 nucleic acids in a 

10 sample, such as in a biological sample, which methods comprise the steps 
of: 1) contacting a sample suspected to contain a disease-associated 
nucleic acid with one or more disease-associated nucleic acid probes under 
conditions in which hybrids can form between any of the probes and 
disease-associated nucleic acid in the sample; and 2) detecting any hybrids 

15 formed in step (1) using any suitable means known in the art, wherein the 
detection of hybrids indicates the presence of the disease-associated 
nucleic acid in the sample. To detect disease-associated nucleic acids 
present in low levels in biological samples, it may be necessary to amplify 
the disease-associated sequences or the hybridization signal as part of the 

20 diagnostic assay. Techniques for amplification are known to those of skill 
in the art. 

The presence of Gene 216 polynucleotide sequences can be 
detected by DNA-DNA or DNA-RNA hybridization, or by amplification using 
probes or primers comprising at least a portion of a Gene 216 

25 polynucleotide, or a sequence complementary thereto. In particular, nucleic 
acid amplification-based assays can use Gene 216 oligonucleotides or 
oligomers to detect transformants containing Gene 216 DNA or RNA. Gene 
216 nucleic acids useful as probes in diagnostic methods include 
oligonucleotides at least 15 nucleotides in length, preferably at least 20 

30 nucleotides in length, and most preferably at least 25-55 nucleotides in 
length, that hybridize specifically with Gene 216 nucleic acids. 
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Several methods can be used to produce specific probes for Gene 216 
polynucleotides. For example, labeled probes can be produced by oligo- 
labeling, nick translation, end-labeling, or PCR amplification using a labeled 
nucleotide. Alternatively, Gene 216 polynucleotide sequences (e.g., SEQ ID 
5 NO:1 or SEQ ID NO:6), or any portions or fragments thereof, may be cloned 
into a vector for the production of an mRNA probe. Such vectors are known 
in the art, are commercially available, and may be used to synthesize RNA 
probes In vitro by addition of an appropriate RNA polymerase, such as T7, T3, 
or SP (6) and labeled nucleotides. These procedures may be conducted using 

10 a variety of commercially available kits (e.g., from Amersham-Pharmacia; 
Promega Corp.; and U.S. Biochemical Corp., Cleveland, OH). Suitable 
reporter molecules or labels which may be used include radionucleotides, 
enzymes, fluorescent, chemiluminescent, or chromogenic agents, as well as 
substrates, cofactors, inhibitors, magnetic particles, and the like. 

15 A sample to be analyzed, such as, for example, a tissue sample (e.g., 

hair or buccal cavity) or body fluid sample (e.g., blood or saliva), may be 
contacted directly with the nucleic acid probes. Alternatively, the sample 
may be treated to extract the nucleic acids contained therein. It will be 
understood that the particular method used to extract DNA will depend on 

20 the nature of the biological sample. The resulting nucleic acid from the 
* sample may be subjected to gel electrophoresis or other size separation 
techniques, or, the nucleic acid sample may be immobilized on an 
appropriate solid matrix without size separation. 

Kits suitable for nucleic acid-based diagnostic applications typically 

25 include the following components: 

(1 ) Probe DNA; The probe DNA may be prelabeled; alternatively, 
the probe DNA may be unlabeled and the ingredients for labeling may be 
included in the kit in separate containers; and 

(2) Hybridization reagents: The kit may also contain other suitably 
30 packaged reagents and materials needed for the particular hybridization 

protocol, including solid-phase matrices, if applicable, and standards. 
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in cases where a disease condition is suspected to involve an 
alteration of the Gene 216 nucleotide sequence, specific oligonucleotides 
may be constructed and used to assess the level of disease mRNA in cells 
affected or other tissue affected by the disease. For example, PCR can be 
5 used to test whether a person has a disease-related polymorphism (i.e., 
mutation). 

For PCR analysis, Gene 216 oligonucleotides may be chemically 
synthesized, generated enzymatically, or produced from a recombinant 
source. Oligomers will preferably comprise two nucleotide sequences, one 

10 with a sense orientation (5 ? -> 3') and another with an antisense orientation 
(3* 5 1 ), employed under optimized conditions for identification of a specific 
gene or condition. The same two oligomers, nested sets of oligomers, or 
even a degenerate pool of oligomers may be employed under less stringent 
conditions for detection and/or quantification of closely related DNA or RNA 

15 sequences. 

In accordance with PCR analysis, two oligonucleotides are 
synthesized by standard methods or are obtained from a commercial 
supplier of custom-made oligonucleotides. The length and base 
composition are determined by standard criteria using the Oligo 4.0 primer 

20 Picking program (W. Rychlik, 1992; available from Molecular Biology 
Insights, Inc., Cascade, CO). One of the oligonucleotides is designed so 
that it will hybridize only to the disease gene DNA under the PCR conditions 
used. The other oligonucleotide is designed to hybridize a segment of 
genomic DNA such that amplification of DNA using these oligonucleotide 

25 primers produces a conveniently identified DNA fragment. Samples may be 
obtained from hair follicles, whole blood, or the buccal cavity. The DNA 
fragment generated by this procedure is sequenced by standard techniques. 
In one particular aspect, Gene 216 oligonucleotides can be used to 
perform Genetic Bit Analysis (GBA) of Gene 216 in accordance with 

30 published methods (T.T. Nikiforov et al., 1994, Nucleic Acids Res. 
22(20):41 67-75; T.T. Nikiforov TT et al., 1994, PCR Methods Appi. 3(5):285- 
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91 ). In PCR-based GBA, specific fragments of genomic DNA containing the 
polymorphic site(s) are first amplified by PCR using one unmodified and one 
phosphorothioate-modified primer. The double-stranded PCR product is 
rendered single-stranded and then hybridized to immobilized oligonucleotide 
5 primer in wells of a multi-well plate. The primer is designed to anneal 
immediately adjacent to the polymorphic site of interest. The 3' end of the 
primer is extended using a mixture of individually labeled dideoxynucleoside 
triphosphates. The label on the extended base is then determined. 
Preferably, GBA is performed using semi-automated ELISA or biochip 

10 formats (see, e.g., S.R. Head et al., 1997, Nucleic Acids Res. 25(24):5065- 
71;T.T. Nikiforovetal., 1994, Nucleic Acids Res. 22(20):4 167-75). 

Other amplification techniques besides PCR may be used as 
alternatives, such as ligation-mediated PCR or techniques involving Q-beta 
replicase (Cahill et al., 1991, Clin. Chem., 37(9): 1482-5). Products of 

1 5 amplification can be detected by agarose gel electrophoresis, quantitative 
hybridization, or equivalent techniques for nucleic acid detection known to 
one skilled in the art of molecular biology (Sambrook et al., 1989). Other 
alterations in the disease gene may be diagnosed by the same type of 
amplification-detection procedures, by using oligonucleotides designed to 

20 contain and specifically identify those alterations. 

Gene 216 polynucleotides may also be used to detect and quantify 
levels of Gene 216 mRNA in biological samples in which altered expression of 
Gene 216 polynucleotide may be correlated with disease. These diagnostic 
assays may be used to distinguish between the absence, presence, increase, 

25 and decrease of Gene 216 mRNA levels, and to monitor regulation of Gene 
216 polynucleotide levels during therapeutic treatment or intervention. For 
example, Gene 216 polynucleotide sequences, or fragments, or 
complementary sequences thereof, can be used in Southern or Northern 
analysis, dot blot, or other membrane-based technologies; in PCR 

30 technologies; or in dip stick, pin, ELISA or biochip assays utilizing fluids or 
tissues from patient biopsies to detect the status of, e.g., levels or 
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overexpression of Gene 216, or to detect altered Gene 216 expression. Such 
qualitative or quantitative methods are well known in the art (G.H. Keller and 
M.M. Manak, 1993, DNA Probes, 2 nd Ed, Macmillan Publishers Ltd., England; 
D.W. Dieffenbach and G. S. Dveksler, 1995, PCR Primer: A Laboratory 
5 Manual, Cold Spring Harbor Press, Plainview, NY; B.D. Hames and S.J. 
Higgins, 1985, Gene Probes 1, 2, IRL Press at Oxford University Press, 
Oxford, England). 

Methods suitable for quantifying the expression of Gene 216 include 
radiolabeling or biotinylating nucleotides, co-amplification of a control nucleic 

10 acid, and standard curves onto which the experimental results are interpolated 
(P.C. Melby et al., 1993, J. Immunol Methods 159:235-244; and C. Duplaa et 
a!., 1993, Anal. Biochem. 229-236). The speed of quantifying multiple samples 
may be accelerated by running the assay in an EL ISA format where the 
oligomer of interest is presented in various dilutions and a spectrophotometric 

15 or colorimetric response gives rapid quantification. 

In accordance with these methods, the specificity of the probe, i.e., 
whether it is made from a highly specific region (e.g., at least 8 to 10 or 12 or 
15 contiguous nucleotides fn the 5 1 regulatory region), or a less specific region 
(e.g., especially in the 3' coding region), and the stringency of the hybridization 

20 or amplification (e.g., high, intermediate, or low) will determine whether the 
probe identifies only naturally occurring sequences encoding the Gene 216 
polypeptide, alleles thereof, or related sequences. 

In a particular aspect, a Gene 216 nucleic acid sequence, or a 
sequence complementary thereto, or fragment thereof, may be useful in 

25 assays that detect Gene 21 6-related diseases such as asthma. The Gene 21 6 
polynucleotide can be labeled by standard methods f '1and added to a biological 
sample from a subject under conditions suitable for the formation of 
hybridization complexes. After a suitable incubation period, the sample can be 
washed and the signal is quantified and compared with a standard value. If the 

30 amount of signal in the test sample is significantly altered from that of a 
comparable negative control (normal) sample, the altered levels of Gene 216 
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nucleotide sequence can be correlated with the presence of the associated 
disease. Such assays may also be used to evaluate the efficacy of a particular 
prophylactic or therapeutic regimen in animal studies, in clinical trials, or for an 
individual patient. 

5 To provide a basis for the diagnosis of a disease associated with altered 

expression of Gene 216, a normal or standard profile for expression is 
established. This may be accomplished by incubating biological samples taken 
from normal subjects, either animal or human, with a sequence complementary 
to the Gene 216 polynucleotide, or a fragment thereof, under conditions 

10 suitable for hybridization or amplification. Standard hybridization may be 
quantified by comparing the values obtained from normal subjects with those 
from an experiment where a known amount of a substantially purified 
polynucleotide is used. Standard values obtained from normal samples may 
be compared with values obtained from samples from patients who are 

1 5 symptomatic for the disease. Deviation between standard and subject (patient) 
values is used to establish the presence of the condition. 

Once the disease is diagnosed and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to evaluate whether 
the level of expression in the patient begins to approximate that which is 

20 observed in a normal individual. The results obtained from successive assays 
may be used to show the efficacy of treatment over a period ranging from 
several days to months. 

With respect to diseases such as asthma, the presence of an abnormal 
amount of Gene 216 transcript in a biological sample (e.g., body fluid, cells, 

25 tissues, or cell or tissue extracts) from an individual may indicate a 
predisposition for the development of the disease, or may provide a means for 
detecting the disease prior to the appearance of actual clinical symptoms. A 
more definitive diagnosis of this type may allow health professionals to employ 
preventative measures or aggressive treatment earlier, thereby preventing the 

30 development or further progression of the disease. 
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Microarrays : In another embodiment of the present invention, 
oligonucleotides, or longer fragments derived from the Gene 216 
polynucleotide sequence described herein may be used as targets in a 
microarray (e.g., biochip) system. The microarray can be used to monitor the 
5 expression level of large numbers of genes simultaneously (to produce a 
transcript image), and to identify genetic variants, mutations, and 
polymorphisms. This information may be used to determine gene function, to 
understand the genetic basis of a disease, to diagnose disease, and to develop 
and monitor the activities of therapeutic or prophylactic agents. Preparation 

10 and use of microarrays have been described in WO 95/1 1995 to Chee et al.; 
D.J. Lockhart et al., 1996, Nature Biotechnology 14:1675-1680; M. Schena et 
al., 1996, Proc. Natl. Acad. ScL USA 93:10614-10619; U.S. Patent No. 
6,015,702 to P. Lai et al; J. Worley et al., 2000, Microarray Biochip 
Technology, M. Schena, ed., Biotechniques Book, Natick, MA, pp. 65-86; Y.H. 

15 Rogers et al., 1999, Anal. Biochem. 266(1):23-30; S.R. Head et al., 1999, Mol. 
Cell. Probes. 13(2):81-7; S.J. Watson et al., 2000, Biol: Psychiatry 
48(12):1 147-56. 

In one application of the present invention, microarrays containing 
arrays of Gene 216 polynucleotide sequences can be used to measure the 

20 expression levels of Gene 216 in an individual. In particular, to diagnose an 
individual with a Gene 216-related condition or disease, a sample from a 
human or animal (containing nucleic acids, e.g., mRNA) can be used as a 
probe on a biochip containing an array of Gene 216 polynucleotides (e.g., 
DNA) in decreasing concentrations (e.g., 1 ng, 0.1 ng, 0.01 ng, etc.). The test 

25 sample can be compared to samples from diseased and normal samples. 
Biochips Gan also be used to identify Gene 216 mutations or polymorphisms 
in a population, including but not limited to, deletions, insertions, and 
mismatches. For example, mutations can be identified by: 1) placing Gene 
216 polynucleotides of this invention onto a biochip; 2) taking a test sample 

30 (containing, e.g., mRNA) and adding the sample to the biochip; 3) determining 
if the test samples hybridize to the Gene 216 polynucleotides attached to the 
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chip under various hybridization conditions (see, e.g., V.R. Chechetkin et al., 
2000, J. Biomol. Struct. Dyn. 1S(1):83-101). Alternatively microarray 
sequencing can be performed (see, e.g., E.P. Diamandis, 2000, Clin. Chem. 
46(10): 1523-5). 

5 Chromosome mapping : In another application of this invention, the 

Gene 216 nucleic acid sequence, or a complementary sequence, or fragment 
thereof, can be used as probes which are useful for mapping the naturally 
occurring genomic sequence. The sequences may be mapped to a particular 
chromosome, to a specific region of a chromosome, or to human artificial 

10 chromosome constructions (HACs), yeast artificial chromosomes (YACs), 
bacterial artificial chromosomes (BACs), bacterial PI constructions, or single 
chromosome cDNA libraries (see CM. Price, 1993, Blood Rev., 7:127-134 and 
by B.J. Trask, 1991, Trends Genet 7:149-154). 

in another of its aspects, the invention relates to a diagnostic kit for 

1 5 detecting Gene 216 polynucleotide or polypeptide as it relates to a disease or 
susceptibility to a disease, particularly asthma. Also related is a diagnostic kit 
that can be used to detect or assess asthma conditions. Such kits comprise 
one or more of the following: 

(a) a Gene 216 polynucleotide, preferably the nucleotide sequence 
20 of SEQ ID NO:1 or SEQ ID NO:6, or a fragment thereof; or 

(b) a nucleotide sequence complementary to that of (a); or 

(c) a Gene 216 polypeptide, preferably the polypeptide of SEQ ID 
NO:4, or a fragment thereof; or 

(d) an antibody to a Gene 216 polypeptide, preferably to the 
25 polypeptide of SEQ ID NO:4, or an antibody bindable fragment thereof. It will 

be appreciated that in any such kits, (a), (b), (c), or (d) may comprise a 
substantial component and that instructions for use can be included. The kits 
may also contain peripheral reagents such as buffers, stabilizers, etc. 

The present invention also includes a test kit for genetic screening that 
30 can be utilized to identify mutations in Gene 216. By identifying patients with 
mutated Gene 216 DNA and comparing the mutation to a database that 
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contains known mutations in Gene 216 and a particular condition or disease, 
identification and/or confirmation of, a particular condition or disease can be 
made. Accordingly, such a kit would comprise a PCR-based test that would 
involve transcribing the patients mRNA with a specific primer, and amplifying 
5 the resulting cDNA using another set of primers. The amplified product would 
be detectable by gel electrophoresis and could be compared with known 
standards for Gene 216. Preferably, this kit would utilize a patient's blood, 
serum, or saliva sample, and the DNA would be extracted using standard 
techniques. Primers flanking a known mutation would then be used to amplify 

10 a fragment of Gene 216. The amplified piece would then be sequenced to 
determine the presence of a mutation. 

Genomic Screening : The use of polymorphic genetic markers linked to 
the Gene 216 gene is very useful in predicting susceptibility to the diseases 
genetically linked to 20p13-p12. Similarly, the identification of polymorphic 

15 genetic markers within the Gene 216 gene will allow the identification of 
specific allelic variants that are in linkage disequilibrium with other genetic 
lesions that affect one of the disease states discussed herein including 
respiratory disorders, obesity, and inflammatory bowel disease. SSCP (see 
below) allows the identification of polymorphisms within the genomic and 

20 coding region of the disclosed gene. The present invention provides 
sequences for primers that can be used identify exons that contain SNPs, as 
well as sequences for primers that can be used to identify the sequence 
change. This information can be used to identify additional SNPs in 
accordance with the methods disclosed herein. Suitable methods for genomic 

25 screening have also been described by, e.g., Sheffield et a!., 1995, Genet, 
4:1837-1844; LeBfanc-Straceski et al., 1994, Genomics, 19:341-9; Chen et af., 
1995, Genomics, 25:1-8. In employing these methods, the disclosed reagents 
can be used to predict the risk for disease (e.g., respiratory disorders, obesity, 
and inflammatory bowel disease) in a population or individual. 

30 Therapeutics 

The present invention provides methods of screening for drugs 
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comprising contacting such an agent with a novel protein of this invention or 
fragment thereof and assaying 1) for the presence of a complex between the 
agent and the protein or fragment, or 2) for the presence of a complex 
between the protein or fragment and a iigand, by methods well known in the 
5 art. In such competitive binding assays the novel protein or fragment is 
typically labeled. Free protein or fragment is separated from that present in 
a protein: protein complex, and the amount of free (i.e., uncomplexed) label 
is a measure of the binding of the agent being tested to Gene 216 protein 
or its interference with protein ligand binding, respectively. 

10 This invention also contemplates the use of competitive drug 

screening assays in which neutralizing antibodies capable of specifically 
binding the Gene 21 6 protein compete with a test compound for binding to 
the Gene 216 protein or fragments thereof. In this manner, the antibodies 
can be used to detect the presence of any peptide that shares one or more 

15 antigenic determinants of a Gene 216 protein. 

The goal of rational drug design is to produce structural analogs of 
biologically active proteins of interest or of small molecules with which they 
interact (e.g., agonists, antagonists, inhibitors) in order to fashion drugs 
which are, for example, more active or stable forms of the protein, or which, 

20 e.g., enhance or interfere with the function of a protein in vivo (see, e.g., 
Hodgson, 1991, Bio/Technology, 9:19-21). In one approach, one first 
determines the three-dimensional structure of a protein of interest or, for 
example, of the Gene 216 receptor or ligand complex, by x-ray 
crystallography, by computer modeling or most typically, by a combination 

25 of approaches. Less often, useful information regarding the structure of a 
protein may be gained by modeling based on the structure of homologous 
proteins. An example of rational drug design is the development of HIV 
protease inhibitors (Erickson et al., 1990, Science, 249:527-533). In 
addition, peptides (e.g., Gene 216 protein) are analyzed by an alanine scan 

30 (Weils, 1991, Methods in Enzymol., 202:390-411). In this technique, an 
amino acid residue is replaced by Ala, and its effect on the peptides activity 
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is determined. Each of the amino acid residues of the peptide is analyzed 
in this manner to determine the important regions of the peptide. 

It is also possible to isolate a target-specific antibody, selected by a 
functional assay, and then to solve its crystal structure. In principle, this 
5 approach yields a pharmacore upon which subsequent drug design can be 
based. It is possible to bypass protein crystallography altogether by 
generating anti-idiotypic antibodies (anti-ids) to a functional, 
pharmacologically active antibody. As a mirror image of a mirror image, the 
binding site of the anti-ids would be expected to be an analog of the original 

10 Gene 216 protein. The anti-id could then be used to identify and isolate 
peptides from banks of chemically or biologically produced banks of 
peptides. Selected peptides would then act as the pharmacore. 

Thus, one may design drugs which result in, for example, altered 
Gene 216 protein activity or stability or which act as inhibitors, agonists, 

1 5 antagonists, etc. of Gene 216 protein activity. By virtue of the availability of 
cloned Gene 216 gene sequences, sufficient amounts of the Gene 216 
protein may be made available to perform such analytical studies as x-ray 
crystallography- In addition, the knowledge of the Gene 216 polypeptide 
sequence will guide those employing computer-modeling techniques in place 

20 of, or in addition to x-ray crystallography. 

In another aspect of the present invention, cells and animals that 
carry the Gene 216 gene or an analog thereof can be used as model 
systems to study and test for substances that have potential as therapeutic 
agents. After a test substance is administered to animals or applied to the 

25 cells, the phenotype of the animals/cells can be determined. 

In yet another aspect of this invention, antibodies that specifically react 
with Gene 216 polypeptide of peptides derived therefrom can be used as 
therapeutics. In particular, anti-Gene 216 antibodies can be used to block the 
Gene 216 activity. Anti-Gene 216 antibodies or fragments thereof can be 

30 formulated as pharmaceutical compositions and administered to a subject. It 
is noted that antibody-based therapeutics produced from non-human sources 



-79- 



WO 01/78894 



PCT/US01/12245 



can cause an undesired immune response in human subjects. To minimize 
this problem, chimeric antibody derivatives can be produced. Chimeric 
antibodies combine a non-human animal variable region with a human 
constant region. Chimeric antibodies can be constructed according to methods 
5 known in the art (see Morrison et al., 1985, Proc. Natl. Acad. Sci. USA, 
81:6851; Takeda et al., 1985, Nature 314:452; U.S. Patent No. 4,816,567 of 
Cabiliy et al.; U.S. Patent No. 4,816,397 of Boss et al.; European Patent 
Publication EP 171496; EP 0173494; United Kingdom Patent GB 2177096B). 
In addition, antibodies can be further "humanized" by any of the techniques 

10 known in the art, (e.g., Teng et ah, 1983, Proc. Natl. Acad. Sci. USA 80:7308- 
7312; Kozbor et al., 1983, Immunology Today 4: 7279; Olsson et al., 1982, 
Meth. Enzymol. 92:3-16; International Patent Application WO92/06193; EP 
0239400). Humanized antibodies can also be obtained from commercial 
sources (e.g., Scotgen Limited, Middlesex, Great Britain). Immunotherapy with 

15 a humanized antibody may result in increased long-term effectiveness for the 
treatment of chronic disease situations or situations requiring repeated 
antibody treatments. 

In one embodiment, compositions (e.g., pharmaceutical compositions) 
for use with the present invention comprise metalloprotease inhibitors, or 

20 analogs or derivatives thereof. Non-limiting examples of metalloprotease 
inhibitors include: 1) naturally occurring inhibitors, e.g., oprin (J.J. Catanese 
and L.F. Kress, 1992, Biochemistry 31:410-418; HSF (Y. Yamakawa and T. 
Ornori-Satoh, 1992, J. Biochem. 112:583-589); erinacin (D. Mebs et al., 1996, 
Toxicon 34:1313-1316; Ornori-Satoh et al., 2000, Toxicon 38:1561-1580); 

25 DM40 and DM43 (A.G. Neves-Ferreira et al., 2000, Biochem. Biophys. Acta. 
1473:309-320); citrate (B. Francis et al., 1992, Toxicon 30:1239-1246); T1MP-1 
and TIMP-2 (R.V. Ward et al., 1991, Biochem J. 278, Pt 1:179-873); 
pyrophosphate (G.S. Makowski and M.L. Ramsby, 1999, \nflammation 23:333- 
360); proglutamyl peptides such as pyroGlu-Asn-Trp-OH and pyroGlu-Glu-Trp- 

30 OH (A. Robeva et al., 1991, Biomed. Biochem. Acta. 50:769-773); 2) peptide 
analogs and derivatives, e.g., 2-distereomeric furan-2-carbonylamino-3- 
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oxohexahydroindoijzino[8,7-b]indole carboxylates (S. D'Alessio et al., 2001, 
Eur. J. Med. Chem. 36:43-53); phosphonate and carboxylate derivatives of 
pyroGlu-Asn-Trp-OH (D'Alessio et al., 2001); POL 647 and POL 656 (F.X. 
Gomis-Ruth et al., 1998, Prof. Sci. 7:283-292); cysteine-switches (K. Nomura 
5 and N. Suzuki, 1993, FEBS Lett. 321:84-88); 3) hydroxamate compounds, e.g., 
batimastat/BB-94 (see, e.g., G.F. Beattie et al., 1998, Clin. Cancer Res. 
8:1899-1902); prinomastat/AG3340 (see, e.g., R. Scatena, 2000, Expert Opin. 
Investig. Drugs 9:2159-2165); and 4) other inhibitors, e.g., ortho-substituted 
macrocyclic lactams (G.M. Ksander, 1997, J. Med. Chem. 40:495-505); 

10 diketopiperazine (DKP) (A.K. Szardenings et al., 1998, J. Med. Chem. 
41 (13):2 194-200; alendronate/PCP (Makowski and Ramsby, 1999); and 
CT1746 (Z. An et al., 1997, Clin. Exp. Metastasis 15:184-195). 

In particular, the determined structures of metalloproteases and 
metal lop rotease inhibitors can be used to devise Gene 216-targeted inhibitors 

15 (i.e., by rational drug design; see Szardenings et al, 1998). Structural 
information can be found in, e.g., C. Oefner et al., 2000, J. MoL Biol. 
296(2):341-9; B. Wu et al., 2000, J. MoL Biol. 295(2):257~68; L. Chen et a)., 
1999, J. Mol. Biol. 293(3):545-57; C. Fernandez-Catalanet al., 1998, EMBOJ. 
17(17):5238-48; S. Arumugam et al., 1998, Biochemistry 37(27):9650-7; 

20 Gohlke et al., 1996, FEBS Lett. 378:126-130; Gomis-Ruth et al., 1998; F.X. 
Gomis-Ruth etal, 1993, EMBOJ. 12:4151-4157; F.X. Gomis-Ruth etal, 1996, 
J. Mol. Biol. 264:556-566; K. Maskos et al., 1998, Proc. Natl. Acad. Sci. USA 
95(7):3408-12; F.X. Gomis-Ruth et al, 1997, Nature 389:77-80; M. Betz et al., 
1997, Eur. J. Biochem. 247(1 ):356-63; B. Lovejoy et al., 1994, Biochemistry 

25 33(27):8207-17. Structures of zinc metalloproteases are also found in 
Molecular Modeling DataBase (MMDB) at the NCBI web site 
http.7/www.ncbi.nlm.nih.gov:80/Structure/MMDB/mmdb.shtml (e.g. Accession 
Nos. 1D5J, 1D8F, 1D7X, 1BSK, 2TLX, 1TLX, 1BUD, 1BSW, 1UEA, 4AIG, 
3AIG, 2AfG, 1KUH, 1DTH, 1UMS, 1UMT, 7TLN, 6TMN, 5TMN, 5TLN, 4TMN, 

30 4TLN, 3TMN, 2TMN, 1TMN, 1TLP, 1IAG, 1HYT, 1AST, 8TLN, 1THL). In an 
alternative approach, the binding specificity of TIMP proteins can be 
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engineered to produce inhibitors that specifically inactivate Gene 216 
polypeptide (see, e.g., H. Nagase et al., 1999, Ann. NY Acad. Sci. 878:1-11; 
G.S. Butler et al., 1999, J. Biol. Chem. 274(29):20391 -20396). 

In another embodiment of the present invention, compositions (e.g., 
5 pharmaceutical compositions) for use with the present invention comprise 
disintegrin agonists, or analogs or derivatives thereof. The determined 
structures of disintegrin proteins and domains can be used to devise Gene 216 
disintegrin-targeted agonists (i.e., by rational drug design). Such structural 
information can be found in R.A. Atkinson et al., 1994, Int. J. Pept. Protein Res. 

10 43:563-72; V. Saudek et al., 1991 , Eur. J. Biochem. 202:329-38; H. Minoux et 
al., 2000, J. Comput. Aided Mol. Des. 14:317-27. 

The present invention contemplates compositions comprising a Gene 
216 polynucleotide, polypeptide, antibody, ligand (e.g., agonist, antagonist, or 
inhibitor), or fragments, variants, or anafogs thereof, and a physiologically 

15 acceptable carrier, excipient, or diluent as described in detail herein. The 
present invention further contemplates pharmaceutical compositions useful in 
practicing the therapeutic methods of this invention. Preferably, a 
pharmaceutical composition includes, in admixture, a pharmaceutical^ 
acceptable excipient (carrier) and one or more of a Gene 216 polypeptide, 

20 polynucleotide, ligand, antibody, or fragment or variant thereof, as described 
herein, as an active ingredient. The preparation of pharmaceutical 
compositions that contain Gene 216-related reagents as active ingredients is 
well understood in the art. Typically, such compositions are prepared as 
injectables, either as liquid solutions or suspensions, however, solid forms 

25 suitable for solution in, or suspension in, liquid prior to injection can also be 
prepared. The preparation can also be emulsified. The active therapeutic 
ingredient is often mixed with excipients that are pharmaceutical^ acceptable 
and compatible with the active ingredient. Suitable excipients are, for example, 
water, saline, dextrose, glycerol, ethanol, or the like and combinations thereof. 

30 In addition, if desired, the composition can contain minor amounts of auxiliary 
substances such as wetting or emulsifying agents, pH-buffering agents, which 
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enhance the effectiveness of the active ingredient. 

A Gene 216 polypeptide, polynucleotide, figand, antibody, or variant or 

fragment thereof can be formulated into the pharmaceutical composition as 

neutralized physiologically acceptable salt forms. Suitable salts include the 
5 acid addition salts (i.e., formed with the free amino groups of the polypeptide 

or antibody molecule) and which are formed with inorganic acids such as, for 

example, hydrochloric or phosphoric acids, or such organic acids as acetic, 

oxalic, tartaric, mandelic, and the like. Salts formed from the free carboxyl 

groups can also be derived from inorganic bases such as, for example, 
1 0 sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic 

bases as isopropylamine, trim ethyla mine, 2-ethylamino ethanol, histidine, 

procaine, and the like. 

The pharmaceutical compositions can be administered systemically by 

oral or parenteral routes. Non-limiting parenteral routes of administration 
15 include subcutaneous, intramuscular, intraperitoneal, intravenous, transdermal, 

inhalation, intranasal, intra-arterial, intrathecal, enteral, sublingual, or rectal. 
Intravenous administration, for example, can be performed by injection of a 

unit dose. The term "unit dose" when used in reference to a pharmaceutical 

composition of the present invention refers to physically discrete units suitable 
20 as unitary dosage for humans, each unit containing a predetermined quantity 

of active materia) calculated to produce the desired therapeutic effect in 

association with the required diluent; i.e., carrier, or vehicle. 

In one particular embodiment of the present invention, the disclosed 

pharmaceutical compositions are administered via mucoactive aerosol therapy 
25 (see, e.g., M. Fuloria and B.K. Rubin, 2000, Respir. Care 45:868-873; I. 

Gonda, 2000, J. Pharm. Sci. 89:940-945; R. Dhandf2000, Curr. Opin. Pulm. 

Med, 6(1):59-70; B.K. Rubin, 2000, Respir. Care 45(6):684-94; S. Suarez and 

A.J. Hickey, 2000, Respir. Care. 45(6):652-66). 

Pharmaceutical compositions are administered in a manner compatible 
30 with the dosage formulation, and in a therapeutically effective amount. The 

quantity to be administered depends on the subject to be treated, capacity of 
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the subject's immune system to utilize the active ingredient, and degree of 
modulation of Gene 216 activity desired. Precise amounts of active ingredient 
required to be administered depend on the judgment of the practitioner and are 
specific for each individual. However, suitable dosages may range from about 
5 0.1 to 20, preferably about 0.5 to about 10, and more preferably one to several, 
milligrams of active ingredient per kilogram body weight of individual per day 
and depend on the route of administration. Suitable regimes for initial 
administration and booster shots are also variable, but are typified by an initial 
administration followed by repeated doses at one or more hour intervals by a 

10 subsequent injection or other administration. Alternatively, continuous 
intravenous infusions sufficient to maintain concentrations of 10 nM to 1 0 jJVI 
in the blood are contemplated. An exemplary pharmaceutical formulation 
comprises: Gene 21 6 antagonist or inhibitor (5.0 mg/ml); sodium bisulfite USP 
(3.2 mg/ml); disodium edetate USP (0.1 mg/ml); and water for injection q.s.a.d. 

15 (1 .0 ml). As used herein, "pg" means picogram, "ng" means nanogram, >g n 
means microgram, "mg" means milligram, "p.!" means microliter, "ml" means 
milliliter, and T means L. 

For further guidance in preparing pharmaceutical formulations, see, 
e.g., Gilman et at. (eds), 1990, Goodman and Gilman's: The Pharmacological 

20 Basis of Therapeutics, 8th ed., Pergamon Press; and Remington's 
Pharmaceutical Sciences, 17th ed., 1990, Mack Publishing Co., Easton, PA; 
Avis et a I. (eds), 1993, Pharmaceutical Dosage Forms: Parenteral 
Medications, Dekker, New York; Lieberman et al. (eds), 1990, Pharmaceutical 
Dosage Forms: Disperse Systems, Dekker, New York. 

25 Pharmacogenetics : The Gene 216 polypeptides and polynucleotides 

are also useful in pharmacogenetic analysis (i.e., the study of the relationship 
between an individual's genotype and that individual's response to a 
therapeutic composition or drug). See, e.g., M. Eichelbaum, 1996, Clin. Exp, 
Pharmacol. Physiol. 23(1 0-1 1):983-985, and M.W. Under, 1997, Clin. Chem. 

30 43(2):254-266. The genotype of the individual can determine the way a 
therapeutic acts on the body or the way the body metabolizes the therapeutic. 
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Further, the activity of drug metabolizing enzymes affects both the intensity 
and duration of therapeutic activity. Differences in the activity or metabolism 
of therapeutics can lead to severe toxicity or therapeutic failure. Accordingly, 
a physician or clinician may consider applying knowledge obtained in relevant 
5 pharmacogenetic studies in determining whether to administer a Gene 216 
polypeptide, polynucleotide, analog, antagonist, inhibitor, or modulator, as well 
as tailoring the dosage and/or therapeutic or prophylactic treatment regimen. 

In general, two types of pharmacogenetic conditions can be 
differentiated. Genetic conditions can be due to a single factor that alters the 

10 way the drug act on the body (altered drug action), or a factor that alters the 
way the body metabolizes the drug (altered drug metabolism). These 
conditions can occur either as rare genetic defects or as naturally-occurring 
polymorphisms. For example, glucose-6-phosphate dehydrogenase deficiency 
(G6PD) is a common inherited enzymopathy which results in haemolysis after 

15 ingestion of oxidant drugs (anti-malarials, sulfonamides, analgesics, 
nitrofurans) and consumption of fava beans. 

The discovery of genetic polymorphisms of drug metabolizing enzymes 
(e.g., N-acetyltransferase 2 (NAT 2) and cytochrome P450 enzymes CYP2D6 
and CYP2C19) has provided an explanation as to why some patients do not 

20 obtain the expected drug effects or show exaggerated drug response and 
serious toxicity after taking the standard and safe dose of a drug. These 
polymorphisms are expressed in two phenotypes in the population, the 
extensive metabolizer (EM) and poor metabolizer (PM). The prevalence of PM 
is different among different populations. The gene coding for CYP2D6 is highly 

25 polymorphic and several mutations have been identified in PM, which all lead 
to the absence of functional CYP2D6. Poor metabolizers quite frequently 
experience exaggerated drug response and side effects when they receive 
standard doses. If a metabolite is the active therapeutic moiety, PM show no 
therapeutic response. This has been demonstrated for the analgesic effect of 

30 codeine mediated by its CYP2D6-formed metabolite morphine. At the other 
extreme, ultra-rapid metabolizers fail to respond to standard doses. Recent 



-85- 



WO 01/78894 PCT/US01/12245 



studies have determined that ultra-rapid metabolism is attributable to CYP2D6 
gene amplification. 

By analogy, genetic polymorphism or mutation may lead to allelic 
variants of Gene 21 6 in the population which have different levels of activity. 
5 The Gene 216 polypeptides or polynucleotides thereby allow a clinician to 
ascertain a genetic predisposition that can affect treatment modality. In 
addition, genetic mutation or variants at other genes may potentiate or diminish 
the activity of Gene 216-targeted drugs. Thus, in a Gene 216-based treatment, 
polymorphism or mutation may give rise to individuals that are more or less 

10 responsive to treatment. Accordingly, dosage would necessarily be modified 
to maximize the therapeutic effect within a given population containing the 
polymorphism. As an alternative to genotyping, specific polymorphic 
polypeptides or polynucleotides can be identified. 

To identify genes that modify Gene 216-targeted drug response, several 

15 pharmacogenetic methods can be used. One pharmacogenomics approach, 
"genome-wide association", relies primarily on a high-resolution map of the 
human genome. This high-resolution map shows previously identified gene- 
related markers (e.g., a "bi-allelic" gene marker map which consists of 60,000- 
100,000 polymorphic or variable sites on the human genome, each of which 

20 has two variants). A high-resolution genetic map can then be compared to a 
map of the genome of each of a statistically significant number of patients 
taking part in a Phase I l/l II drug trial to identify markers associated with a 
particular observed drug response or side effect. Alternatively, a high- 
resolution map can be generated from a combination of some ten million 

25 known single nucleotide polymorphisms (SNPs) in the human genome. Given 
a genetic map based on the occurrence of such SNPs, individuals can be 
grouped into genetic categories depending on a particular pattern of SNPs in 
their individual genome. In this way, treatment regimens can be tailored to 
groups of genetically similar individuals, taking into account traits that may be 

30 common among such genetically similar individuals (see, e.g., D.R. Pfost et al. f 
2000, Trends Biotechnol. 18(8):334-8). 
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As another example, the "candidate gene approach", can be used. 
According to this method, if a gene that encodes a drug target is known, all 
common variants of that gene can be fairly easily identified in the population 
and it can be determined if having one version of the gene versus another is 
5 associated with a particular drug response. 

As yet another example, a "gene expression profiling approach", can be 
used. This method involves testing the gene expression of an animal treated 
with a drug (e.g., a Gene 216 polypeptide, polynucleotide, analog, or 
modulator) to determine whether gene pathways related to toxicity have been 
10 turned on. 

Information obtained from one of the approaches described herein can 
be used to establish a pharmacogenetic profile, which can be used to 
determine appropriate dosage and treatment regimens for prophylactic or 
therapeutic treatment an individual. A pharmacogenetic profile, when applied 

15 to dosing or drug selection, can be used to avoid adverse reactions or 
therapeutic failure and thus enhance therapeutic or prophylactic efficiency 
when treating a subject with a Gene 216 polypeptide, polynucleotide, analog, 
antagonist, inhibitor, or modulator. 

Gene 216 polypeptides or polynucleotides are also useful for monitoring 

20 therapeutic effects during clinical trials and other treatment. Thus, the 
therapeutic effectiveness of an agent that is designed to increase or decrease 
gene expression, polypeptide levels, or activity can be monitored over the 
course of treatment using the Gene 216 compositions or modulators. For 
example, monitoring can be performed by: 1) obtaining a p re-ad ministration 

25 sample from a subject prior to administration of the agent; 2) detecting the level 
of expression or activity of the protein in the pre-ad ministration sample; 3) 
obtaining one or more post-administration samples from the subject; 4) 
detecting the level of expression or activity of the polypeptide in the post- 
administration samples; 5) comparing the level of expression or activity of the 

30 polypeptide in the pre-ad ministration sample with the polypeptide in the post- 
ad ministration sample or samples; and 6) increasing or decreasing the 
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administration of the agent to the subject accordingly. 

Gene Therapy : In recent years, significant technological advances have 
been made in the area of gene therapy for both genetic and acquired diseases 
(Kay et a!., 1997, Proc. Natl. Acad. Sci. USA, 94:12744-12746). Gene therapy 
5 can be defined as the transfer of DNA for therapeutic purposes. Improvement 
in gene transfer methods has allowed for development of gene therapy 
protocols for the treatment of diverse types of diseases. Gene therapy has 
also taken advantage of recent advances in the identification of new 
therapeutic genes, improvement in both viral and non-viral gene delivery 
1 0 systems, better understanding of gene regulation, and improvement in cell 
isolation and transplantation. Gene therapy would be carried out according to 
generally accepted methods as described by, for example, Friedman, 1991, 
Therapy for Genetic Diseases, Friedman, Ed., Oxford University Press, pages 
105-121. 

15 Vectors for introduction of genes both for recombination and for 

extrachromosoma! maintenance are known in the art, and any suitable 
vector may be used. Methods for introducing DNA into cells such as 
electro poration, calcium phosphate co-precipitation, and viral transduction 
are known in the art, and the choice of method is within the competence of 

20 one skilled in the art (Robbins (ed), 1997, Gene Therapy Protocols, Human 
Press, NJ). Cells transformed with a Gene 216 gene can be used as model 
systems to study chromosome 20 disorders and to identify drug treatments 
for the treatment of such disorders. 

Gene transfer systems known in the art may be useful in the practice 

25 of the gene therapy methods of the present invention. These include viral 
and non-viral transfer methods. A number of viruses have been used as 
gene transfer vectors, including polyoma, i.e., SV40 (Madzak et al., 1992, 
J. Gen. Virol., 73:1533-1536), adenovirus (Berkner, 1992, Curr. Top. 
Microbiol. Immunol., 158:39-6; Berkner et al., 1988, Bio Techniques, 6:616- 

30 629; Gorziglia et al., 1992, J. Virol., 66:4407-4412; Quantin et al., 1992, 
Proc. Natl. Acad. Sci. USA, 89:2581-2584; Rosenfeld et al., 1992, Cell, 
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68:143-155; Wilkinson et al., 1992, Nuci. Acids Res., 20:2233-2239; 
Stratford-Perricaudet et aL, 1990, Hum. Gene Ther., 1:241-256), vaccinia 
virus (Mackett et a!., 1992, Biotechnology, 24:495- 499), adeno-associated 
virus (Muzyczka, 1992, Curr. Top. Microbiol. Immunol., 158:91- 123; Ohi et 
5 al., 1990, Gene, 89:279-282), herpes viruses including HSV and EBV 
(Margolskee, 1992, Curr. Top. Microbiol. Immunol., 158:67-90; Johnson et 
al., 1992, J. Virol., 66:2952-2965; Finketal., 1992, Hum. Gene Ther., 3:11- 
19; Breakfieid et al., 1987, Mol. Neurobiol., 1:337-371; Fresse et al., 1990, 
Biochem. Pharmacol., 40:2189-2199), and retroviruses of avian 

10 (Brandyopadhyay et al., 1984, Mol. Cell Biol., 4:749-754; Petropouplos et 
al., 1992, J. Virol., 66:3391-3397), murine (Miller, 1992, Curr. Top. Microbiol. 
Immunol., 158:1-24; Miller et al., 1985, Mol. Cell Biol., 5:431-437; Sorge et 
al., 1984, Mol Celi Biol., 4:1730-1737; Mann et al., 1985, J. Virol., 54:401- 
407), and human origin (Page et al., 1990, J. Virol., 64:5370-5276; 

15 Buchschalcher et al., 1992, J. Virol., 66:2731-2739). Most human gene 
therapy protocols have been based on disabled murine retroviruses. 

Non-viral gene transfer methods known in the art include chemical 
techniques such as calcium phosphate coprecipitation (Graham et al., 1973, 
Virology, 52:456-467; Pellicer et al., 1980, Science, 209:1414-1422), 

20 mechanical techniques, for example microinjection (Anderson et al., 1980, 
Proc. Natl. Acad. Sci. USA, 77:5399-5403; Gordon et al., 1980, Proc. Natl. 
Acad. Sci USA, 77:7380-7384; Brinster et al., 1981, Cell, 27:223-231; 
Constantini et al., 1981, Nature, 294:92-94), membrane fusion-mediated 
transfer via liposomes (Feigner et al., 1987, Proc. Natl. Acad. Sci. USA, 

25 84:7413-7417; Wang et al., 1989, Biochemistry, 28:^508-9514; Kaneda et 
al., 1989, J. Biol. Chem., 264:12126-12129; Stewart et al., 1992, Hum. Gene 
Ther., 3:267-275; Nabel et al., 1990, Science, 249:1285-1288; Lim et al., 
1992, Circulation, 83:2007-2011), and direct DNA uptake and receptor- 
mediated DNA transfer (Wolff et al., 1990, Science, 247:1465-1468; Wu et 

30 al., 1991, BioTechniques, 11:474-485; Zenke et al., 1990, Proc. Natl. Acad. 
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Sci. USA, 87:3655-3659; Wu et al., 1989, J. Biol. Chem., 264:16985-16987; 
Wolff etaL, 1991 , BioTechniques, 11:474-485; Wagner et al., 1991, Proc. 
Natl. Acad. Sci. USA, 88:4255-4259; Cotten et al., 1990, Proc. Natl. Acad. 
Sci. USA, 87:4033-4037; Curiel et al., 1991, Proc. Natl. Acad. Sci. USA, 
5 88:8850-8854; Curiel et al., 1991, Hum. Gene Ther., 3:147-154). 

In one approach, plasmid DNA is complexed with a polylysine- 
conjugated antibody specific to the adenovirus hexon protein, and the 
resulting complex is bound to an adenovirus vector. The trimolecular 
complex is then used to infect cells. The adenovirus vector permits efficient 

10 binding, internalization, and degradation of the endosome before the 
coupled DNA is damaged. 

In another approach, liposome/DNA is used to mediate direct in vivo 
gene transfer. While in standard liposome preparations the gene transfer 
process is non-specific, localized in vivo uptake and expression have been 

1 5 reported in tumor deposits, for example, following direct in situ administration 
(Nabel, 1992, Hum. Gene Then, 3:399-410). 

Suitable gene transfer vectors possess a promoter sequence, preferably 
a promoter that is cell-specific and placed upstream of the sequence to be 
expressed. The vectors may also contain, optionally, one or more expressible 

20 marker genes for expression as an indication of successful transfection and 
expression of the nucleic acid sequences contained in the vector. In addition, 
vectors can be optimized to minimize undesired immunogenicity and maximize 
long-term expression of the desired gene product(s) (see Nabe, 1999, Proc. 
Natl. Acad. Sci. USA 96:324-326). Moreover, vectors can be chosen based on 

25 cell-type that is targeted for treatment. Notably, gene transfer therapies have 
been initiated for the treatment of various pulmonary diseases (see, e.g., M.J. 
Welsh, 1999, J. Clin. Invest 1 04(9): 1 165-6; D.L. Ennist, 1999, Trends 
Pharmacol. Sci. 20:260-266; S.M. Albelda et al., 2000, Ann. Intern. Med. 
132:649-660; E. Alton and C. Kitson C, 2000, Expert Opin. Investig. Drugs. 

30 9(7):1 523-35). 

Illustrative examples of vehicles or vector constructs for transfection or 
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infection of the host cells include replication-defective viral vectors, DNA virus 
or RNA virus (retrovirus) vectors, such as adenovirus, herpes simplex virus and 
adeno-associated viral vectors. Adeno-associated virus vectors are single 
stranded and allow the efficient delivery of multiple copies of nucleic acid to the 
5 cell's nucleus. Preferred are adenovirus vectors. The vectors will normally be 
substantially free of any prokaryotic DNA and may comprise a number of 
different functional nucleic acid sequences. An example of such functional 
sequences may be a DNA region comprising transcriptional and translational 
initiation and termination regulatory sequences, including promoters (e.g., 

10 strong promoters, inducible promoters, and the like) and enhancers which are 
active in the host cells. Also included as part of the functional sequences is an 
open reading frame (polynucleotide sequence) encoding a protein of interest 
Flanking sequences may also be included for site-directed integration. In 
some situations, the 5'-flanking sequence will allow homologous 

15 recombination, thus changing the nature of the transcriptional initiation region, 
so as to provide for inducible or non-inducible transcription to increase or 
decrease the level of transcription, as an example. 

In general, the encoded and expressed Gene 216 polypeptide may be 
intracellular, i.e., retained in the cytoplasm, nucleus, or in an organelle, or may 

20 be secreted by the cell. For secretion, the natural signal sequence present in 
Gene 216 may be retained. When the polypeptide or peptide is a fragment of 
a Gene 216 protein, a signal sequence may be provided so that, upon 
secretion and processing at the processing site, the desired protein will have 
the natural sequence. Specific examples of coding sequences of interest for 

25 use in accordance with the present invention include the Gene polypeptide 
coding sequences, e.g., SEQ ID NO:4. 

As previously mentioned, a marker may be present for selection of cells 
containing the vector construct. The marker may be an inducible or non- 
inducible gene and will generally allow for positive selection under induction, 

30 or without induction, respectively. Examples of marker genes include 
neomycin, dihydrofolate reductase, glutamine synthetase, and the like. The 



-91 - 



WO 01/78894 



PCT/US01/12245 



vector employed will generally also include an origin of replication and other 
genes that are necessary for replication in the host cells, as routinely employed 
by those having skill in the art. As an example, the replication system 
comprising the origin of replication and any proteins associated with replication 
5 encoded by a particular virus may be included as part of the construct The 
replication system must be selected so that the genes encoding products 
necessary for replication do not ultimately transform the cells. Such replication 
systems are represented by replication-defective adenovirus (see G. Acsadi et 
al., 1994, Hum. MoL Genet 3:579-584) and by Epstein-Barr virus. Examples 

10 of replication defective vectors, particularly, retroviral vectors that are 
replication defective, are BAG, (see Price et al., 1987, Proc. Natl. Acad. Sci. 
USA, 84:156; Sanes et al., 1986, EMBO J., 5:3133). It will be understood that 
the final gene construct may contain one or more genes of interest, for 
example, a gene encoding a bioactive metabolic molecule. In addition, cDNA, 

1 5 synthetically produced DNA or chromosomal DNA may be employed utilizing 
methods and protocols known and practiced by those having skill in the art. 

According to one approach for gene therapy, a vector encoding a Gene 
216 polypeptide is directly injected into the recipient cells (in vivo gene 
therapy). Alternatively, cells from the intended recipients are explanted, 

20 genetically modified to encode a Gene 216 polypeptide, and reimplanted into 
the donor (ex vivo gene therapy). An ex vivo approach provides the advantage 
of efficient viral gene transfer, which is superior to in vivo gene transfer 
approaches. In accordance with ex vivo gene therapy, the host cells are first 
transfected with engineered vectors containing at least one gene encoding a 

25 Gene 216 polypeptide, suspended in a physiologically acceptable carrier or 
excipient such as saline or phosphate buffered saline, and the like, and then 
administered to the host. The desired gene product is expressed by the 
injected cells, which thus introduce the gene product into the host. The 
introduced gene products can thereby be utilized to treat or ameliorate a 

30 disorder that is related to altered levels of Gene 21 6 (e.g., asthma). 
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Animal Models 

Gene 216 polynucleotides can be used to generate genetically altered 
non-human animals or human cell lines. Any non-human animal can be used; 
however typical animals are rodents, such as mice, rats, or guinea pigs. 
5 Genetically engineered animals or cell lines can carry a gene that has been 
altered to contain deletions, substitutions, insertions, or modifications of the 
polynucleotide sequence (e.g., exon sequence). Such alterations may render 
the gene nonfunctional, (i.e., a null mutation) producing a "knockout" animal or 
cell line, (n addition, genetically engineered animals can carry one or more 

1 0 exogenous or non-naturally occurring genes, i.e., "transgenes", that are derived 
from different organisms (e.g., humans), or produced by synthetic or 
recombinant methods. Genetically altered animals or cell lines can be used to 
study Gene 216 function, regulation, and treatments for Gene 216-related 
diseases. In particular, knockout animals and cell lines can be used to 

15 establish animal models and in vitro models for Gene 216-related illnesses, 
respectively. In addition, transgenic animals expressing human Gene 216 can 
be used in drug discovery efforts. 

A "transgenic animal" is any animal containing one or more cells 
bearing genetic information altered or received, directly or indirectly, by 

20 deliberate genetic manipulation at a subcellular level, such as by targeted 
recombination or microinjection or infection with recombinant virus. The term 
"transgenic animal" is not intended to encompass classical cross-breeding or 
in vitro fertilization, but rather is meant to encompass animals in which one or 
more cells are altered by, or receive, a recombinant DNA molecule. This 

25 recombinant DNA molecule may be specifically targeted to a defined genetic 
locus, may be randomly integrated within a chromosome, or it may be 
extrachromosomally replicating DNA. 

Transgenic animals can be selected after treatment of germline cells or 
zygotes. For example, expression of an exogenous Gene 216 gene or a 

30 variant can be achieved by operably linking the gene to a promoter and 
optionally an enhancer, and then microinjecting the construct into a zygote 
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(see, e.g., Hogan et al., Manipulating the Mouse Embryo, A Laboratory 
Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY). Such 
treatments include insertion of the exogenous gene and disrupted homologous 
genes. Alternatively, the gene(s) of the animals may be disrupted by insertion 
5 or deletion mutation of other genetic alterations using conventional techniques 
(see, e.g., Capecchi, 1989, Science, 244:1288; Valancuis et al., 1991, Mo/. Cell 
Biol., 11:1402; Hasty et al., 1991 , Nature, 350:243; Shinkai et al., 1992, Cell, 
68:855; Mombaerts et al., 1992, Cell, 68:869; Philpott et al., 1992, Science, 
256:1448; Snouwaert etal., 1992, Science, 257:1083; Donehoweret al., 1992, 

10 Nature, 356:215). 

In one aspect of the invention, Gene 216 knockout mice can be 
produced in accordance with well-known methods (see, e.g., M.R. Capecchi, 
1989, Science, 244:1288-1292; P. Li et al., 1995, Cell 80:401 -411; L.A. Galli- 
Taliadoros et al., 1995, J. Immunol. Methods 181(1):1~15; C.H. Westphal et al., 

15 1997, Curr. Biol. 7(7):530-3; S.S. Cheah et al., 2000, Methods Mol. Biol. 
1 36:455-63). The disclosed murine Gene 216 genomic clone can be used to 
prepare a Gene 216 targeting construct that can disrupt Gene 216 in the 
mouse by homologous recombination at the Gene 216 chromosomal locus. 
The targeting construct can comprise a disrupted or deleted Gene 216 

20 sequence that inserts in place of the functioning portion of the native mouse 
gene. For example, the construct can contain an insertion in the Gene 216 
protein-coding region. 

Preferably, the targeting construct contains markers for both positive 
and negative selection. The positive selection marker allows the selective 

25 elimination of cells that lack the marker, while the negative selection marker 
allows the elimination of cells that carry the marker. In particular, the positive 
selectable marker can be an antibiotic resistance gene, such as the neomycin 
resistance gene, which can be placed within the coding sequence of Gene 216 
to render it non-functional, while at the same time rendering the construct 

30 selectable. The herpes simplex virus thymidine kinase (HSV tk) gene is an 
example of a negative selectable marker that can be used as a second marker 
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to eliminate cells that carry it. Cells with the HSV tk gene are selectively killed 
in the presence of gangcyclovir. As an example, a positive selection marker 
can be positioned on a targeting construct within the region of the construct 
that integrates at the Gene 216 locus. The negative selection marker can be 
5 positioned on the targeting construct outside the region that integrates at the 
Gene 216 locus. Thus, if the entire construct is present in the cell, both 
positive and negative selection markers will be present. If the construct has 
integrated into the genome, the positive selection marker will be present, but 
the negative selection marker will be lost. 

10 The targeting construct can be employed, for example, in embryonal 

stem ceil (ES). ES cells may be obtained from pre-implantation embryos 
cultured in vitro (M.J. Evans et a!., 1981 , Nature 292:154-156; M.O. Bradley et 
al., 1984, Nature 309:255-258; Gossler et a(., 1986, Proa Natl. Acad. Sci. USA 
83:9065-9069; Robertson et al., 1986, Nature 322:445-448; S. A. Wood et al., 

15 1993, Proc. Natl. Acad. ScL USA 90:4582-4584). Targeting constructs can be 
efficiently introduced into the ES cells by standard techniques such as DNA 
transfection or by retrovirus-mediated transduction. Following this, the 
transformed ES cells can be combined with blastocysts from a non-human 
animal. The introduced ES cells colonize the embryo and contribute to the 

20 germ line of the resulting chimeric animal (R. Jaenisch, 1988, Science 
240:1468-1474). The use of gene-targeted ES cells in the generation of gene- 
targeted transgenic mice has been previously described (Thomas et al., 1987, 
Cell 51:503-512) and is reviewed elsewhere (Frohman et al., 1989, Cell 
56:145-147; Capecchi, 1989, Trends in Genet 5:70-76; Baribault et al., 1989, 

25 Mol. Biol. Med. 6:481-492; Wagner, 1 990, EMBO J. j>;3025-3032; Bradley et 
al., 1 992, Bio/Technology 10: 534-539). 

Several methods can be used to select homologousiy recombined 
murine ES cells. One method employs PGR to screen pools of transformant 
cells for homologous insertion, followed by screening individual clones (Kim et 

30 al., 1988, Nucleic Acids Res. 16:8887-8903; Kim et al., 1991, Gene 103:227- 
233). Another method employs a marker gene is constructed which will only 
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be active if homologous insertion occurs, allowing these recombinants to be 
selected directly (Sedivy et al. f 1989, Proc. Natl. Acad. Sci. USA 86:227-231). 
For example, the positive-negative selection (PNS) method can be used as 
described above (see, e.g., Mansour et al., 1988, Nature 336:348-352; 
5 Capecchi, 1989, Science 244:1288-1292; Capecchi, 1989, Trends in Genet 
5:70-76). In particular, the PNS method is useful for targeting genes that are 
expressed at low levels. 

The absence of functional Gene 216 in the knockout mice can be 
confirmed, for example, by RNA analysis, protein expression analysis, and 

10 functional studies. For RNA analysis, RNA samples are prepared from 
different organs of the knockout mice and the Gene 216 transcript is detected 
in Northern blots using oligonucleotide probes specific for the transcript. For 
protein expression detection, antibodies that are specific for the. Gene 216 
polypeptide are used, for example, in flow cytometric analysis, 

15 immunohistochemical staining, and activity assays. Alternatively, functional 
assays are performed using preparations of different cell types collected from 
the knockout mice. 

Several approaches can be used to produce transgenic mice. In one 
approach, a targeting vector is integrated into ES cell by homologous 

20 recombination, an intrachromosomal recombination event is used to eliminate 
the selectable markers, and only the transgene is left behind (A.L. Joyner et al., 
1989, Nature 338(621 1):1 53-6; P. Hasty etai., 1991, Nature 350(631 5):243-6; 
V. Valancius and O. Smithies, 1991, Mol. Ceil Biol. 11 (3): 1402-8; S. Fiering et 
al., 1993, Proc. Natl. Acad. Sci. USA 90(18):8469-73). In an alternative 

25 approach, two or more strains are created; one strain contains the gene 
knocked-out by homologous recombination, while one or more strains contain 
transgenes. The knockout strain is crossed with the transgenic strain to 
produce new line of animals in which the original wild-type allele has been 
replaced (although not at the same site) with a transgene. Notably, knockout 

30 and transgenic animals can be produced by commercial facilities (e.g., The 
Lerner Research Institute, Cleveland, OH; B&K Universal, Inc., Fremont, CA; 
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DNX Transgenic Sciences, Cranbury, NJ; Incyte Genomics, Inc., St. Louis, 
MO). 

Transgenic animals (e.g., mice) containing a nucleic acid molecule 
which encodes human Gene 216, may be used as in vivo models to study the 
5 overexpression of Gene 216. Such animals can also be used in drug 
evaluation and discovery efforts to find compounds effective to inhibit or 
modulate the activity of Gene 21 6, such as for example compounds for treating 
respiratory disorders, diseases, or conditions. One having ordinary skill in the 
art can use standard techniques to produce transgenic animals which produce 

10 human Gene 216 polypeptide, and use the animals in drug evaluation and 
discovery projects (see, e.g., U.S. Patent No. 4,873,191 to Wagner; U.S. 
Patent No. 4,736,866 to Leder). 

In another embodiment of the present invention, the transgenic animal 
can comprise a recombinant expression vector in which the nucleotide 

15 sequence that encodes human Gene 216 is operably linked to a tissue specific 
promoter whereby the coding sequence is only expressed in that specific 
tissue. For example, the tissue specific promoter can be a mammary cell 
specific promoter and the recombinant protein so expressed is recovered from 
the animal's milk. 

20 In yet another embodiment of the present invention, a Gene 216 

"knockout" can be produced by administering to the animal antibodies (e.g., 
neutralizing antibodies) that specifically recognize an endogenous Gene 216 
polypeptide. The antibodies can act to disrupt function of the endogenous 
Gene 216 polypeptide, and thereby produce a null phenptype. In one specific 

25 example, an orthologous mouse Gene 216 polypeptide (e.g., SEQ ID NO:366) 
or peptide can be used to generate antibodies. These antibodies can be given 
to a mouse to knockout the function of the mouse Gene 216 ortholog. 

In addition, non-mammalian organisms may be used to study Gene 216 
and Gene 216-related diseases. For example, model organisms such as C. 

30 elegans, D. melanogaster, and S. cerevisiae may be used. Gene 216 
homologues can be identified in these model organisms, and mutated or 
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deleted to produce a Gene 216-deficient strain. Human Gene 216 can then be 
tested for the ability to "complement" the Gene 21 6-deficient strain. Gene 216- 
deficient strains can also be used for drug screening. The study of Gene 216 
homoiogs can facilitate the understanding of human Gene 216 biological 
5 function, and assist in the identification of binding proteins (e.g., agonists and 
antagonists). 
Gene Identification 

To identify genes in the region on 20p13-p12, a set of bacteria! artificial 
chromosome(BAC) clones containing this chromosomal region was identified 

10 in accordance with the methods described herein. The BAC clones served as 
a template for genomic DNA sequencing and served as reagents for identifying 
coding sequences by direct cDNA selection. Genomic sequencing and direct 
cDNA selection methods were used to characterize DNA from 20p13-p12. 

When one or more genes have been genetically localized to a specific 

15 chromosomal region, the gene(s) can be characterized at the molecular level 
by a series of steps that include: 1) cloning the entire region of DNA in a set 
of overlapping clones (physical mapping); 2) characterizing the gene(s) 
encoded by these clones by a combination of direct cDNA selection, exon 
trapping and DNA sequencing (gene identification); and 3) identifying 

20 mutations (i.e., SNPs) in the gene(s) by comparative DNA sequencing of 
affected and unaffected members of the kindred and/or in unrelated affected 
individuals and unrelated unaffected controls (mutation analysis). 

Physical mapping is accomplished by screening libraries of human DNA 
cloned in vectors that are propagated in a host such as E. co//, using 

25 hybridization or PCR assays from unique molecular landmarks in the 
chromosomal region of interest. In accordance with the present invention, a 
physical map of the disorder region was generated by screening a library of 
human DNA cloned in BACs with a set overgo markers that had been 
previously mapped to chromosome 20p13-pl2 by the efforts of the Human 

30 Genome Project. Overgos are unique molecular landmarks in the human 
genome that can be assayed by hybridization. The location of thousands of 
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overgos on the twenty-two autosomes and two sex chromosomes has been 
determined through the efforts of the Human Genome Project. For a positional 
cloning effort, the physical map is tied to the genetic map because the markers 
used for genetic mapping can aiso be used as overgos for physical mapping. 
5 By screening a BAC library with a combination of overgos derived from genetic 
markers, genes, and random DNA fragments, a physical map comprised of 
overlapping clones representing all of the DNA in a chromosomal region of 
interest can be assembled. 

BACs are cloning vectors for large (80 kifobase to 200 kilobase) 

10 segments of human or other DNA that are propagated in E. co/i. To construct 
a physical map using BACs, a library of BAC clones is screened so that 
individual clones harboring the DNA sequence corresponding to a given overgo 
or set of overgos are identified. Throughout most of the human genome, the 
overgo markers are spaced approximately 20 to 50 kilobases apart, so that an 

15 individual BAC clone typically contains at least two overgo markers. In 
addition, the BAC libraries that were screened contain enough cloned DNA to 
cover the human genome twelve times over. An individual overgo typically 
identifies more than one BAC clone. By screening a twelve-fold coverage BAC 
library with a series of overgo markers spaced approximately 50 kilobases 

20 apart, a physical map consisting of a series of overlapping contiguous BAC 
clones, i.e., BAC "contigs," can be assembled for any region of the human 
genome. This map is closely tied to the genetic map because many of the 
overgo markers used to prepare the physical map are also genetic markers. 
When constructing a physical map, it often happens that there are gaps 

25 in the overgo map of the genome that result in the inability to identify BAC 
clones that are overlapping in a given location. Typically, the physical map is 
first constructed from a set of overgos identified through the publicly available 
literature and World Wide Web resources. The initial map consists of several 
separate BAC contigs that are separated by gaps of unknown molecular 

30 distance. To identify BAC clones that fill these gaps, it is necessary to develop 
new overgo markers from the ends of the clones on either side of the gap. 

-99- 



WO 01/78894 



PCT/US01/12245 



This is done by sequencing the terminal 200 to 300 base pairs of the BACs 
flanking the gap, and developing a PCR or hybridization based assay. If the 
terminal sequences are demonstrated to be unique within the human genome, 
then the new overgo can be used to screen the BAC library to identify 
5 additional BACs that contain the DNA from the gap in the physical map. To 
assemble a BAC contig that covers a region the size of the disorder region 
(6,000,000 or more base pairs), it is necessary to develop new overgo markers 
from the ends of a number of clones. 

After building a BAC contig, this set of overlapping clones serves as a 

10 template for identifying the genes encoded in the chromosomal region. Gene 
identification can be accomplished by many methods. Three methods are 
commonly used: 1) a set of BACs selected from the BAC contig to represent 
the entire chromosomal region are sequenced, and computational methods are 
used to identify all of the genes; 2) the BACs from the BAC contig are used as 

15 a reagent to clone cDNAs corresponding to the genes encoded in the region 
by a method termed direct cDNA selection; or 3) the BACs from the BAC contig 
are used to identify coding sequences by selecting for specific DNA sequence 
motifs in a procedure called exon trapping. Gene 216 was identified by 
methods (1) and (2) in accordance with the techniques disclosed herein. 

20 To sequence the entire BAC contig representing the disorder region, a 

set of BACs can be chosen for subcloning into plasmid vectors and subsequent 
DNA sequencing of these subclones. Since the DNA cloned in the BACs 
represents genomic DNA, this sequencing is referred to as genomic 
sequencing to distinguish it from cDNA sequencing. To initiate the genomic 

25 sequencing for a chromosomal region of interest, several non-overlapping BAC 
clones are chosen. DNA for each BAC clone is prepared, and the clones are 
sheared into random small fragments that are subsequently cloned into 
standard plasmid vectors such as pUC18. The plasmid clones are then grown 
to propagate the smaller fragments, and these are the templates for 

30 sequencing. To ensure adequate coverage and sequence quality for the BAC 
DNA sequence/sufficient plasmid clones are sequenced to yield three-fold 
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coverage of the BAC clone. For example, if the BAC is 100 kilobases long, 
then phagemids are sequenced to yield 300 kilobases of sequence. Since the 
BAC DNA is randomly sheared prior to cloning in the phagemid vector, the 300 
kilobases of raw DNA sequence can be assembled by computational methods 
5 into overlapping DNA sequences termed sequence contigs. For the purposes 
of initial gene identification by computational methods, three-fold coverage of 
each BAC is sufficient to yield twenty to forty sequence contigs of 1000 base 
pairs to 20,000 base pairs. 

In accordance with the present invention, the "seed" BACs from the 

10 BAC contig in the disorder region were sequenced. The sequence of the 
"seed" BACs was then used to identify minimally overlapping BACs from the 
contig, and these were subsequently sequenced. In this manner, the entire 
candidate region can be sequenced, with several small sequence gaps left in 
each BAC. This sequence serves as the template for computational gene 

15 identification. In one approach, genes can be identified by comparing the 
sequence of BAC contig to publicly available databases of cDNA and genomic 
sequences, e.g. UniGene, dbEST, EMBL nucleotide database, GenBank, and 
the DNA Database of Japan (DDBJ). The BAC DNA sequence can also be 
translated into protein sequence, and the protein sequence can be used to 

20 search pub/icly available protein databases, e.g., GenPept, EMBL protein 
database, Protein Information Resource (PIR), Protein Data Bank (PDB), and 
SWISS-PROT. These comparisons are typically done using the BLAST family 
of computer algorithms and programs (Altschul et al., 1990, J. Mol. Biol., 
215:403-410; Altschul et al, 1997, Nucl. Acids Res., 25:3389-3402). 

25 For nucleotide queries, BLASTN, BLASTX, and TBLASTX can be used. 

BLASTN compares a nucleotide query sequence with a nucleotide sequence 
database; BLASTX compares a nucleotide query sequence translated in all 
reading frames against a protein sequence database; TBLASTX compares the 
six-frame translations of a nucleotide query sequence against the six-frame 

30 translations of a nucleotide sequence database. For protein queries, BLASTP 
and TBLASTN can be used. BLASTP compares a protein query sequence 



- 101 - 



WO 01/78894 



PCT/US01/12245 



with a protein sequence database; TBLASTN compares a protein query 
sequence against a nucleotide sequence database dynamically translated in 
all reading frames. 

Additionally, computer algorithms such as MZEF (Zhang, 1997, Proc. 
5 Natl. Acad. Set. USA 94:565-568), GRAIL (Uberbacher et al., 1996, Methods 
Enzymol., 266:259-281), and Genscan (Burge and Karlin, 1997, J. Mol. Biol., 
268:78-94) can be used to predict the location of exons in the sequence based 
on the presence of specific DNA sequence motifs that are common to all 
exons, as well as the presence of codon usage typical of human protein 

10 encoding sequences. 

In addition to identifying genes by computational methods, genes can 
be identified by direct cDNA selection (Del Mastro and Lovett, 1996, Methods 
in Molecular Biology, Humana Press Inc., NJ). In direct cDNA selection, cDNA 
pools from tissues of interest are prepared, and BACs from the candidate 

15 region are used in a liquid hybridization assay to capture the cDNAs which 
base pair to coding regions in the BAC. In the methods described herein, the 
cDNA pools were created from several different tissues by random priming and 
oligo dT priming the first strand cDNA from poly A + RNA, synthesizing the 
second -strand cDNA by standard methods, and adding linkers to the ends of 

20 the cDNA fragments. In this approach, the linkers are used to amplify the 
cDNA pools of BAC clones from the disorder region identified by screening a 
BAC library. The amplified products are then used as a template for initiating 
DNA synthesis to create a biotin labeled copy of BAC DNA. Following this, the 
biotin labeled copy of the BAC DNA is denatured and incubated with an excess 

25 of the PCR amplified, linkered cDNA pools which have also been denatured. 
The BAC DNA and cDNA are allowed to anneal in solution, and 
heteroduplexes between the BAC and the cDNA are isolated using streptavidin 
coated magnetic beads. The cDNAs that are captured by the BAC are then 
amplified using primers complimentary to the linker sequences, and the 

30 hybridization/selection process is repeated for a second round. After two 
rounds of direct cDNA selection, the cDNA fragments are cloned, and a library 
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of these direct selected fragments is created. 

The cDNA clones isolated by direct selection are analyzed by two 
methods. Where the genomic target DNA sequence is obtained from a pool 
of BACs from the disorder region, the cDNAs are mapped to BAG genomic 
5 clones to verify their chromosomal location. This is accomplished by arraying 
the cDNAs in microtiter dishes, and replicating their DNA in high-density grids. 
Individual genomic clones known to map to the region are then hybridized to 
the grid to identify direct selected cDNAs mapping to that region. cDNA clones 
that are confirmed to correspond to individual BACs are sequenced. To 

10 determine whether the cDNA clones isolated by direct selection share 
sequence identity or similarity to previously identified genes, the DNA and 
protein coding sequences are compared to publicly available databases using 
the BLAST family of programs described above. 

The combination of genomic DNA sequence and cDNA sequence 

15 provided by BAC sequencing and by direct cDNA selection yields an initial list 
of putative genes in the region. In the present invention, the genes in the 
region were candidates for the asthma locus. To further characterize each 
gene, Northern blots were performed to determine the size of the transcript 
corresponding to each gene, and to determine which putative exons were 

20 transcribed together to make an individual gene. For Northern blot analysis of 
each gene, probes are prepared from direct selected cDNA clones or by PGR 
amplifying specific fragments from genomic DNA, cDNA or from the BAC 
encoding the putative gene of interest. The Northern blot analysis is used to 
determine the size of the transcript and the tissues in which it is expressed. 

25 For transcripts that are not highly expressed, it is sometimes necessary to 
perform a reverse transcription PCR assay using RNA from the tissues of 
interest as a template for the reaction. 

Gene identification by computational methods and by direct cDNA 
selection provides unique information about the genes in a region of a 

30 chromosome. Once genes are identified, it is possible to examine subjects for 
sequence variants. Variant sequences can be inherited as allelic differences 
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or can arise from spontaneous mutations. 

Inherited alleles can be analyzed for linkage to a disease susceptibility 
locus. Linkage analysis is possible because of the nature of inheritance of 
chromosomes from parents to offspring. During meiosis, the two parental 
5 homologs pair to guide their proper separation to daughter cells. While they 
are paired, the two homologs exchange pieces of the chromosomes, in an 
event called "crossing over" or "recombination." The resulting chromosomes 
contain parts that originate from both parental homologs. The closer together 
two sequences are on the chromosome, the less likely that a recombination 

1 0 event will occur between them, and the more closely linked they are. 

In the present invention, data obtained from the different families were 
combined and analyzed together by a computer using statistical methods 
described herein. The results were then used as evidence for linkage between 
the genetic markers used and an asthma susceptibility locus. 

15 In general, a recombination frequency of 1% is equivalent to 

approximately 1 map unit, a relationship that holds up to frequencies of about 
20% or 20 cM. One centimorgan (cM) is roughly equivalent to 1,000 kb of 
DNA. The entire human genome is 3,300 cM long. In order to find an 
unknown disease gene within 5-10 cM of a marker locus, the whole human 

20 genome can be searched with roughly 330 informative marker loci spaced at 
approximately 10 cM intervals (Botstein et al., 1980, Am. J. Hum. Genet, 
32:314-331). 

The reliability of linkage results is established by using a number of 
statistical methods. The methods most commonly used for the detection by 

25 linkage analysis of oligogenes involved in the etiology of a complex trait are 
non-parametric or model-free methods which have been implemented into the 
computer programs MAPMAKER/SIBS (L. Kruglyak and E.S. Lander, 1995, 
Am. J. Hum. Genet 57:439-454) and GENEHUNTER (L. Kruglyak et al., 1996, 
Am. J. Hum. Genet 58:1347-1363). Typically, linkage analysis is performed 

30 by typing members of families with multiple affected individuals at a given 
marker locus and evaluating if the affected members (excluding parent- 
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offspring pairs) share alleles at the marker locus that are identical by descent 
(IBD) more often than expected by chance alone. 

As a result of the rapid advances in mapping the human genome over 
the last few years, and concomitant improvements in computer methodology, 
5 it has become feasible to carry out linkage analyses using multi-point data. 
Multi-point analysis provides a simultaneous analysis of linkage between the 
trait and several linked genetic markers, when the recombination distance 
among the markers is known. A LOD score statistic is computed at multiple 
locations along a chromosome to measure the evidence that a susceptibility 

10 locus is located nearby. A LOD score Is the logarithm base 10 of the ratio of 
the likelihood that a susceptibility locus exists at a given location to the 
likelihood that no susceptibility locus is located there. By convention, when 
testing a single marker, a total LOD score greater than +3.0 (that is, odds of 
linkage being 1,000 times greater than odds of no linkage) is considered to be 

1 5 significant evidence for linkage. 

Multi-point analysis is advantageous for two reasons. First, the 
informativeness of the pedigrees is usually increased. Each pedigree has a 
certain amount of potential information, dependent on the number of parents 
heterozygous for the marker loci and the number of affected individuals in the 

20 family. However, few markers are sufficiently polymorphic as to be informative 
in all those individuals. If multiple markers are considered simultaneously, then 
the probability of an individual being heterozygous for at least one of the 
markers is greatly increased. Second, an indication of the position of the 
disease gene among the markers may be determined. This allows identification 

25 of flanking markers, and thus eventually allows identification of a small region 
in which the disease gene resides. 
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EXAMPLES 

The examples as set forth herein are meant to exemplify the various 
aspects of the present invention and are not intended to limit the invention in 
any way. 

5 EXAMPLE 1: Family Collection 

Asthma is a complex disorder that is influenced by a variety of factors, 
including both genetic and environmental effects. Complex disorders are 1 
typically caused by multiple interacting genes, some contributing to disease 
development and some conferring a protective effect. The success of linkage 

10 analyses in identifying chromosomes with significant LOD scores is achieved 
in part as a result of an experimental design tailored to the detection of 
susceptibility genes in complex diseases, even in the presence of epistasis and 
genetic heterogeneity. Also important are rigorous efforts in ascertaining 
asthmatic families that meet strict guidelines, and collecting accurate clinical 

15 information. 

Given the complex nature of the asthma phenotype, non-parametric 
affected sib pair analyses were used to analyze the genetic data. This 
approach does not require parameter specifications such as mode of 
inheritance, disease allele frequency, penetrance of the disorder, or phenocopy 

20 rates. Instead, it determines whether the inheritance pattern of a chromosomal 
region is consistent with random segregation. If it is not, affected sibs inherit 
identical copies of alleles more often than expected by chance. Because no 
models for inheritance are assumed, ailele-sharing methods tend to be more 
robust than parametric methods when analyzing complex disorders. They do, 

25 however, require larger sample sizes to reach statistically significant results. 

At the outset of the program, the goal was to collect 400 affected sib- 
pair families for the linkage analyses. Based on a genome scan with markers 
spaced -10 cM apart, this number of families was predicted to provide > 95% 
power to detect an asthma susceptibility gene that caused an increased risk 

30 to first-degree relatives of 3-fold or greater. The assumed relative risk of 3-fold 
was consistent with epidemiological studies in the literature that suggest an 
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increased risk ranging from 3- to 7-fold. The relative risk was based on 
gender, different classifications of the asthma phenotype (i.e. bronchial hyper- 
responsiveness versus physician's diagnosis) and, in the case of offspring, 
whether one or both parents were asthmatic. 
5 The family collection efforts exceeded the initial goal of 400, obtaining 

a total of 444 affected sibling pair (ASP) families, with 342 families from the UK 
and 102 families from the US. The ASP families in the US collection were 
Caucasian with a minimum of two affected siblings that were identified through 
both private practice and community physicians as well as through advertising. 

10 A total of 102 families were collected in Kansas, Nebraska, and Southern 
California. In the UK collection, Caucasian families with a minimum of two 
affected siblings were identified through physicians' registers in a region 
surrounding Southampton and including the Isle of Wight. In both the US and 
UK collections, additional affected and unaffected sibs were collected 

15 whenever possible. An additional 39 families from the United Kingdom were 
utilized from an earlier collection effort with different ascertainment criteria. 
These families were recruited either: 1) without reference to asthma and 
atopy; or 2) by having at least one family member or at least two family 
members affected with asthma. The randomly ascertained samples were 

20 identified from general practitioner registers in the Southampton area. For 
families with affected members, the probands were recruited from hospital 
based clinics in Southampton. Seven pedigrees extended beyond a single 
nuclear family. 

Families were included in the study if they met all of the following 
25 criteria: 1) the biological mother and biological father were Caucasian and 
agreed to participate in the study; 2) at least two biological siblings were alive, 
each with a current physician diagnosis of asthma, and were 5 to 21 years of 
age; and 3) the two siblings were currently taking asthma medications on a 
regular basis. This included regular, intermittent use of inhaled or oral 
30 bronchodilators and regular use of cromolyn, theophylline, or steroids. 

Families were excluded from the study if they met any one of the 
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following criteria: 1) both parents were affected (i.e., with a current diagnosis 
of asthma, having asthma symptoms, or on asthma medications at the time of 
the study); 2) any of the siblings to be included in the study was less than 5 
years of age; 3) any asthmatic family member to be included in the study was 
5 taking beta-blockers at the time of the study, 4) any family member to be 
included in the study had congenitalor acquired pulmonary disease at birth 
(e.g. cystic fibrosis), a history of serious cardiac disease (myocardial infarction) 
or any history of serious pulmonary disease (e.g. emphysema); or 5) any family 
member to be included in the study was pregnant. 

10 An extensive clinical instrument was designed and data from all 

participating family members were collected. The case report form (CRF) 
included questions on demographics, medical Wstory including medications, 
a health survey on the incidence and frequency of asthma, wheeze, eczema, 
hay fever, nasal problems, smoking, and questions on home environment. 

1 5 Data from a video questionnaire designed to show various examples of wheeze 
and asthmatic attacks were also included in the CRF. Clinical data, including 
skin prick tests to 8 common allergens, total and specific IgE levels, and 
bronchial hyper-responsiveness following a methacholine challenge, were also 
collected from all participating family members. All data were entered into a 

20 SAS dataset by IMTCl, a CRO; either by double data entry or scanning 
followed by on-screen visual validation. An extensive automated review of the 
data was performed on a routine basis and a full audit at the conclusion of the 
data entry was completed to verify the accuracy of the dataset. 
EXAMPLE 2: Genome Scan 

25 In order to identify chromosomal regions linked to asthma, the 

inheritance pattern of alleles from genetic markers spanning the genome was 
assessed on the collected family resources. As described above, combining 
these results with the segregation of the asthma phenotype in these families 
allows the identification of genetic markers that are tightly linked to asthma. In 

30 turn, this provides an indication of the location of genes predisposing affected 
individuals to asthma. The genotyping strategy was twofold: 1 ) to conduct a 
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genome wide scan using markers spaced at approximately 10 cM intervals; 
and 2) to target ten chromosomal regions for high density genetic mapping. 
The initial candidate regions for high-density mapping were chosen based on 
suggestions of linkage to these regions by other investigators. 
5 Genotypes of PCR amplified simple sequence microsatellite genetic 

linkage markers were determined using ABI model 377 Automated Sequencers 
(PE Applied Biosystems). Microsatellite markers were obtained from Research 
Genetics Inc. (Huntsville, AL) in the fluorescent dye-conjugated form (see 
Dubovsky et al., 1995, Hum. MoL Genet 4(3):449-452). The markers 

10 comprised a variation of a human linkage mapping pane! as released from the 
Cooperative Human Linkage Center (CHLC), also known as the Weber lab 
screening set version 8. The variation of the Weber 8 screening set consisted 
of 529 markers with an average spacing of 6.9 cM (autosomes only) and 7.0 
cM (all chromosomes). Eighty-nine percent of the markers consisted of either 

15 tri- or tetra-nucleotide microsatellites. There were no gaps present in 
chromosomal coverage greater than 17.5 cM. 

Study subject genomic DNA (5 jaI; 4.5 ng/\i\) was amplified in a 10 \l\ 
PCR reaction using AmpliTaq Gold DNA polymerase (0.225 U); 1 X PCR buffer 
(80 mM (NH 4 ) 2 S0 4 ; 30 mM Tris-HCI (pH 8.8); 0.5% Tween-20); 200 jiM each 

20 dATP, dCTP, dGTP and dTTP; 1.5-3.5 |xM MgCI 2 ; and 250 jJvl forward and 
reverse PCR primers. PCR reactions were set up in 192 well plates (Costar) 
using a Tecan Genesis 150 robotic workstation equipped with a refrigerated 
deck. PCR reactions were overlaid with 20 \x\ mineral oil, and thermocycled on 
an MJ Research Tetrad DNA Engine equipped with four 192 well heads using 

25 the following conditions: 92°C for 3 min; 6 cycles of 92°C for 30 sec, 56°C for 
1 min, 72°C for 45 sec; followed by 20 cycles of 92°C for 30 sec, 55°C for 1 
min, 72°C for 45 sec; and a 6 min incubation at 72°C. 

PCR products of 8-12 microsatellite markers were subsequently pooled 
into two 96-well microtitre plates (2.0 ^l PCR product from TET and FAM 

30 labeled markers, 3.0 \x\ HEX labeled markers) using a Tecan Genesis 200 
robotic workstation and brought to a final volume of 25 uJ with H 2 0. Following 
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this, 1 .9 jllI of pooled PCR product was transferred to a loading plate and 
combined with 3.0 jjlI loading buffer (2.5 ul formamide/blue dextran (9.0 mg/ml), 
0.5 jo.1 GS-500 TAMRA labeled size standard, ABI). Samples were denatured 
in the loading plate for 4 min at 95°C, placed on ice for 2 min, and 
5 electrophoresed on a 5% denaturing polyacrylamide gel (FMC on the ABI 
377XL). Samples (0.8 ul) were loaded onto the gel using an 8 channel 
Hamilton Syringe pipettor. 

Each gel consisted of 62 study subjects and 2 control subjects (CEPH 
parents ID #1331-01 and 1331-02, Coriell Cell Repository, Camden, NJ). 
10 Genotyping gels were scored in duplicate by investigators blind to patient 
identity and affection status using GENOTYPER analysis software V 1.1.12 
(ABI; PE Applied Biosystems). Nuclear families were loaded onto the gel with 
the parents flanking the siblings to facilitate error detection. The final tables 
obtained from the GENOTYPER output for each gel analysed were imported 
1 5 into a SYBASE Database. 

Allele calling (binning) was performed using the SYBASE version of the 
ABAS software (Ghosh et al., 1997, Genome Research 7:165-178). Offsize 
bins were checked manually and incorrect calls were corrected or blanked. 
The binned alleles were then imported into the program MENDEL (Lange et al., 
20 1988, Genetic Epidemiology, 5:471) for inheritance checking using the 
USERM13 subroutine (Boehnke et al., 1991, Am. J. Hum. Genet. 48:22-25). 
Non-inheritance was investigated by examining the genotyping traces and, 
once a/I discrepancies were resolved, the subroutine USERM13 was used to 
estimate allele frequencies. 
25 EXAMPLE 3: Linkage Analysis 

Chromosomal regions harboring asthma susceptibility genes by linkage 
analysis of genotyping data and three separate phenotypes, asthma, bronchial 
hyper-responsiveness, and atopic status were identified as follows. 

1. Asthma Phenotvpe : For the initial linkage analysis, the 
30 phenotype and asthma affection status were defined by a patient who 
answered the following questions in the affirmative: i) have you ever had 
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asthma; ii) do you have a current physician's diagnosis of asthma; and iii) are 
you currently taking asthma medications? Medications included inhaled or oral 
bronchodilators, cromolyn, theophylline, or steroids. Multipoint linkage 
analyses of allele sharing In affected Individuals were performed using the 
5 MAPMAKER/SIBS analysis program (L. Krugtyak and E.S. Lander, 1995, Am. 
J. Hum. Genet 57:439-454). The map location and distances between 
markers were obtained from the genetic maps published by the Marshfield 
medical research foundation (http://www.marshmed.org/genetics/). Ambiguous 
ordering of markers in the Marshfield map was resolved using the program 

10 MULTIMAP (T.C. Matise et al., 1994, Nature Genet. 6:384-390)/ 

Using the discrete phenotype of asthma (yes/no), a candidate region 
was identified on chromosome 20 with a LOD score of 2.94, based on 462 
nuclear families. Figure 1 displays the multipoint LOD score against the map 
location of the markers along chromosome 20. A Maximum LOD Score (MLS) 

1 5 of 2.94 was obtained at location 7.9 cM, 0.3 cM proximal to marker D20S906. 
A second MLS of 2.94 was obtained at marker D20S482 at location 12.1 cM. 
An excess sharing by descent (Identity By Descent (lBD) = 2) of 0.31 was 
observed at both maximum LOD scores. Table 2 lists the single and multipoint 
LOD scores at each marker. Analyses were done using a conservative 

20 approach by weighting multiple sibling pairs within a sibship. When affected 
sib pairs were utilized in the linkage analyses without weighting the LOD score 
on chromosome 20 maximized at D20S482 with a value of 3.19. Thus, these 
data provided strong evidence for the presence of an asthma susceptibility 
gene in this region of chromosome 20. 

25 TABLE 2 



Marker 


Distance 


Single-point 


Multipoint 


D20S502 


0.5 


0.7 


2.4 


D20S103 


2.1 


2.4 


2.3 


D20S117 


2.8 


1.2 


2.0 


GTC4ATG 


6.3 


2.4 


2.5 


GTC3CA 


6.6 


1.3 


2.7 


D20S906 


7.6 


2.9 


2.9 


D20S842 


9.0 


1.3 


2.5 


D20S181 


9.5 


1.8 


2.6 


D20S193 


9.5 


2.5 


2.5 
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D20S889 


11 .2 


1.6 


2.6 


D20S482 


12.1 


1 .9 


2.9 


D20S849 


14.0 


0.8 


2.0 


D20S835 


15.1 


0.5 


1.8 


D20S448 


18.8 


1.4 


1 A 


D20S602 


21.2 


1.1 


1.1 


D20S851 


24.7 


1.0 


0.8 


D20S604 


32.9 


0.0 


0.1 


D20S470 


39.3 


0.0 


0.1 


D20S477 


47.5 


0.0 


0.0 


D20S478 


54.1 


0.0 


0.0 


D20S481 


62.3 


0.0 


0.0 


D20S480 


79.9 


0.0 


0.0 


D20S171 


95.7 


0.4 


0.1 



2. Phenotypic Subgroups : Nuclear families were ascertained by the 
presence of at least two affected siblings with a current physician's diagnosis 
of asthma, as well as the use of asthma medication. In the initial analysis (see 
5 above), the evidence was examined for linkage based on that dichotomous 
phenotype (asthma - yes/no). To further characterize the linkage signals, 
additional quantitative traits were measured in the clinical protocol. Since 
quantitative trait loci (QTL) analysis tools with correction for ascertainment was 
not available, the following approach was taken to refine the linkage and 
10 association analyses: 

i. Phenotypic subgroups that could be indicative of an 
underlying genotypic heterogeneity were identified. Asthma subgroups were 
defined according to 1) bronchial hyper-responsiveness (BHR) to methacholine 
challenge; or 2) to atopic status using quantitative measures like total serum 

15 IgE and specific IgE to common allergens. 

ii. Non-parametric linkage analyses were performed on 
subgroups to test for the presence of a more homogeneous sub-sample. If 
genetic heterogeneity was present in the sample, the amount of allele sharing 
among phenotypically similar siblings was expected to increase in the 

20 appropriate subgroup in comparison to the full sample. A narrower region of 
significant increased allele sharing was also expected to result unless the 
overall LOD score decreased as a consequence of having a smaller sample 
size and of using an approximate partitioning of the data. 

iii. Alternatively, allele sharing probabilities were 

- 112- 



WO 01/78894 



PCT/US01/12245 



parameterized as a function of the quantitative trait value of each child in a 
given sib pair, as advocated by N. Morton and implemented in his program 
BETA (N. Morton, 1996, Proc. Natl. Acad. Scl. USA 93:3471-3476). This 
approach alleviated the need to dichotomize a quantitative trait. However, the 
5 program did not correct for the use of non-independent sib pairs in sibship of 
size 3 or larger. As such it did not provide an accurate measure of the 
significance of a linkage finding, but was used to corroborate the localization 
of the linkage signal. 

3. Results for BHR and IgE : PC 2 o, the concentration of 

10 methacholine resulting in a 20% drop in FEV<| (forced expiratory volume), was 
polychotomized in four groups and analyses were performed on the subsets 
of asthmatic children with mild to severe BHR (PC 20 ^ 4 mg/ml) or PC 2 o(4), as 
well as on the broader subset with borderline to severe BHR (PC 20 < 16 mg/m!) 
or PC 20 (16). As shown in the LOD plot in Figure 2, the MLS for the subset of 

15 127 nuclear families with at least two PC 20 (4) affected sibs was 2.97 at 1 1 .8 
cM, 0.3 cM from D20S482, with an excess sharing by descent of 0.37. As 
shown in Figure 3, for the 218 nuclear families with at least two PC 2 o(16), the 
MLS was 3.93 at D20S482 with an excess sharing of 0.36. Both PC 20 (4) and 
PC 2 o(16) strongly implicated the region of chromosome 20 under the second 

20 peak around marker D20S482. When considering the more extreme 
phenotype, PC 20 (4), a higher proportion of families was linked to the region. 
However, the increase in LOD score for the PC 20 (16) phenotype indicated that 
families concordant for the milder BHR phenotype also contributed to the 
linkage signal and would provide a larger pool of linked families. 

25 Total IgE was dichotomized using an age specific cutoff for elevated 

levels (one standard deviation above the mean). Similarly, a dichotomous 
variable was created using specific IgE to common allergens. An individual 
was assigned a high specific IgE value if his/her level was positive (grass or 
tree) or elevated (> 0.35 KU/L for cat, dog, mite A, mite B, alternaria, or 

30 ragweed) for at least one such measure. In linkage analyses, the subset of 
asthmatic children with high total IgE (274 families) was given a maximum LOD 
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score of 2.3 at 1 1 .6 cM (Figure 4), while the subset with high specific IgE (288 
families) was given a LOD score of 1.87 at 12.1 cM (Figure 5). Similar to the 
BHR results, analyses based on IgE implicated the region under the second 
peak around marker D20S482 The substantially lower LOD scores using the 
5 subset of affected sibs concordant for atopy indicated the presence of groups 
with fewer linked families. Thus, atopy in asthmatic individuals was not the 
primary phenotype associated with the linkage signal on chromosome 20. 

The BETA program (Morton, 1996) was used on two scales for PC20. 
Individuals that did not drop 20% by the last dose administered (16 mg/ml) 

10 were assigned an arbitrary value of 32 mg/ml. First, a (0,1)-severity scale was 
constructed by applying a linear transformation to PC 2 o where 0 mg/m! 
received a score of 1 and 32 mg/ml received a score of 0. For this scale, 
individuals that did not drop 20% in their FEV1 did not contribute to the LOD 
score. A maximum LOD score of 3.43 was achieved at 12.1 cM with marker 

15 D20S482. Second, a linear transformation of PC 2 o was used where 0 mg/ml 
received a score of 1 and 32 mg/ml a score of -1 . In other words, in addition 
to the high concordant pairs, discordant pairs and concordant pairs that did not 
drop would also contribute to the LOD score. In contrast, individuals with PC 2 o 
close to 16 mg/ml would have little impact on the LOD score. A maximum LOD 

20 score of 2.08 was again achieved at 12.1 cM. 

Accordingly, a consistent pattern of evidence by linkage analysis pointed 
to the existence of an asthma susceptibility locus in the vicinity of marker 
D20S482. This was supported by the initial analysis of the asthma (yes/no) 
phenotype and by analyses of BHR in asthmatic individuals. Localization in the 

25 region of marker D20S482 was obtained using both BHR and IgE phenotypes. 
EXAMPLE 4: Physical Mapping 

The linkage results for chromosome 20 described above were used to 
delineate a candidate region for a disorder-associated gene located on 
chromosome 20. Gene discovery efforts were thus initiated in a 25 cM interval 

30 from the 20p telomere (marker D20S502) to marker D20S851 , representing a 
>98% confidence interval. All genes known to map to this interval were 
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considered as candidates. intensive physical mapping (BAC contig 
construction) focused on a 90% confidence interval between markers D20S103 
and D20S916, a 15 cM interval. The discovery of novel genes using direct* 
cDNA selection focused on a 95% confidence interval between markers 
5 D20S502 <20p telomere) and D20S916, a 17 cM region. 

The following section describes details of the efforts to generate cloned 
coverage of the disorder gene region on chromosome 20, i.e., construction of 
a BAC contig spanning the region. There were two primary reasons for using 
this approach: 1) to provide genomic clones for DNA sequencing (analysis of 

1 0 this sequence would provide information about the gene content of the region); 
and 2) to provide reagents for direct cDNA selection (this would provide 
additional information about novel genes mapping to the interval). The 
physical map consisted of an ordered set of molecular landmarks, and a set 
of bacterial artificial chromosome clones (BACs; U.-J. Kim et al., 1996, 

15 Genomics 34:213-218; H. Shizuya et al., 1992, Proc. NatL Acad. Sci. USA 
89:8794-8797) that contained the disorder gene region from human 
chromosome 20p13-p12. 

Figure 6 depicts the BAC/STS content contig map of human 
chromosome 20p13-p12. Markers used to screen the RPC1-1 1 BAC library (P. 

20 deJong, Roswell Park Cancer Institute (RPCI)) are shown in the top row. 
Markers that were present in the Genome Database (GDB, 
http://gdbwww.gdb.org/) are represented by GDB nomenclature. The BAC 
clones are shown below the markers as horizontal lines. BAC RPCI- 
11_1098L22 is labeled and the location of Gene 216, described herein, is 

25 indicated at the top of the figure. 

1. Map Integration . Various publicly available mapping resources 
were utilized to identify existing STS (sequence tagged site) markers (Olson 
et al., 1989, Science, 245:1434-1435) in the 20p13-p12 region. Resources 
included the GDB (http://gdbwww.gdb.org/), Genethon (http://www.genethon. 

30 fr/genethon_en.html), Marshfield Center for Medical Genetics 
(http://www.marshmed.org/genetics/), the Whitehead Institute Genome Center 
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(http://www-genome.wLmit.edu/), GeneMap98, dbSTS and dbEST (NCBI, 
http://www.ncbi.nlm.nih.gov/), the Sanger Centre (http://www.sanger.ac.uk/), 
and the Stanford Human Genome Center (http://www-shgc.stanford.edu/). 
Maps were integrated manually to identify markers mapping to the disorder 
5 region. A list of the markers is provided in Table 3. 

2. Marker Development : Sequences for existing STSs were 
obtained from the GDB, RHDB (http://www.ebi.ac.uk/RHdb/), or NCBI, and 
were used to pick primer pairs (overgos; see Table 3) for BAC library 
screening. Novel markers were developed either from publicly available 

1 0 genomic sequences, proprietary cDNA sequences, or from sequences derived 
from BAC insert ends (described below). Primers were chosen using a script 
that automatically performs vector and repetitive sequence masking using 
CROSSMATCH (P. Green, University of Washington). Subsequent primer 
selection was performed using a customized File maker Pro database 

1 5 (http://www.filemaker.com). Primers for use in PCR-based clone confirmation 
or radiation hybrid mapping (described below) were chosen using the program 
Primer3 (Steve Rozen, Helen J. Skaletsky, 1996, 1997, http://www- 
genome.wi.mit.edu/genome_software/other /p rimer3.html). 

Table 3 



Overgo 


Locus 


DNA 
Type 


Gene 


Forward Primer 


SEQ 

ID NO 


Reverse Primer 


SEQ 
ID NO 


stSG24277 




Genomic 






34 


aaccaccacggattcacgcttc 


45 


stSG408 




EST 




aatatcatgcaccatgacccac 


35 




46 


A005O05 




EST 


Attractin (ATTN) 


tggagtaagtattgtaaactat 


36 


atccccgcaatgaaatagttta 


47 


B849D17AL 




BACend 






37 


gttgagagcccacttagataat 


48 


SN2 




EST 


Sialoadhesln {SN) 


agagccacacatccatgtcctg 


38 




49 


AFMb026xh5 


D20S867 


MSAT 






39 


gccactaggaggcaatggcaat 


50 


SN1 




EST 


Siaioadhesin (SN) 


gagtagtcglagtaccagatgg 


40 




51 


stsH22126 




EST 




glctggcaatggagcatgaaaa 


41 




52 


W!4876 


D20S752 


Genomic 






42 


iga catcaa cttctcctttcct 


53 


S1SG3D448 




EST 






43 




54 


WI18677 




EST 




44 


tctgggagaggacggagctggc 


55 



3. Radiation Hybrid (RhO Mapping : Radiation hybrid mapping was 

performed against the Genebridge4 panel (Gyapay et a!., 1996, Hum. Mot. 

Genet 5:339-46) purchased from Research Genetics, in order to refine the 

chromosomal localization of genetic markers used in genotyping as well as to 
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identify, confirm, and refine localizations of markers from proprietary 
sequences. Standard PCR procedures were used for typing the RH panel with 
markers of interest. Briefly, 1 0 ul PCR reactions contained 25 ng DNA of each 
of the 93 Genebridge4 RH samples. PCR products were electrophoresed on 
5 2% agarose gels (Sigma) containing 0.5 ug/ml ethidium bromide in 1 X TBE at 
150 volts for 45 min. Model A3-1 electrophoresis systems were used (Owl 
Scientific Products, Portsmouth, NH). Typically, gels contained 10 tiers of 
lanes with 50 wells/tier. Molecular weight markers (100 bp ladder, GibcoBRL, 
Rockville, MD) were loaded at both ends of the gel. fmages of the gels were 

10 captured with a Kodak DC40 CCD camera and processed with Kodak 1D 
software (www.kodak.com). The gel data were exported as tab delimited text 
files; names of the files included information about the panel screened, the gel 
image files and the marker screened. These data were automatically imported 
using a customized Perl script into Filemaker databases for data storage and 

15 analysis. The data were then automatically formatted and submitted to an 
internal server for linkage analysis to create a radiation hybrid map using 
RHMAPPER (L. Stein et al., 1995; available from Whitehead Institute/MIT 
Center for Genome Research, at http://www.genome.wi.mit.edu 
/ftp/pub/software/rhmapper/, and via anonymous ftp to ftp.genome.wi.mit.edu, 

20 in the directory /pub/software/rh mapper.) 

4. BAC Library Screening : The protocol used for BAC library 
screening was based on the "overgo" method, originally developed by John 
McPherson at Washington University in St. Louis (http://www.tree.caltech.edu 
/protocols/overgo. html, and W-W. Cai et al., 1998, Genomics 54:387-397). 

25 This method involved filling in the overhangs generated after annealing two 
primers, each 22 nucleotides in length, which overlap by 8 nucleotides. The 
resulting labeled 36 bp product was then used in hybridization-based screening 
of high density grids derived from the RPC1-1 1 BAC library (deJong, supra). 
Typically, 15 probes were pooled together to hybridize 12 filters (13.5 genome 

30 equivalents). 

Stock solutions (2 uM) of combined complementary oligos were heated 
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at 80°C for 5 min, placed at 37°C for 10 min, and then stored on ice. Labeling 
reactions included the following: 1 .0 ul H 2 0; 5 ul mixed oligos (2 uM each); 0.5 
ul BSA (2 mg/ml); 2 ul OLB (-A, - C, -N6) Solution (see below); 0.5 ul 32 P-dATP 
(3000 Ci/mmol); 0.5 ul 32 P-dCTP (3000 Ci/mmol); and 0.5 ul Klenow fragment 
5 (5 U/ul). The reaction was incubated at room temperature for 1 hr, and 
unincorporated nucleotides were removed using Sephadex G50 spin columns. 
Solution O: 1.25 M Tris-HCL, pH 8, 125 M MgCI 2 ; Solution A: 1 ml Solution 
0, 18 pi 2-mercaptoethanol, 5ul 0.1M dTTP, 5ul 0.1 M dGTP; Solution B: 2 M 
HEPES-NaOH, pH 6.6; Solution C: 3 mM Tris-HCI, pH 7.4, 0.2 mM EDTA; 

1 0 Solutions A, B, and C were combined to a final ratio of 1 :2.5:1 .5, and aliquots 
were stored at -20°C. 

High-density BAC library membranes were pre-wetted in 2 X SSC at 
58°C. Filters were then drained slightly and placed in hybridization solution 
(1% BSA; 1 mM EDTA, pH 8.0; 7% SDS; and 0.5 M sodium phosphate), pre- 

15 warmed to 58°C, and incubated at 58°C for 2-4 hr. Typically, 6 filters were 
hybridized in each container. Ten milliliters of pre-hybridization solution was 
removed, combined with the denatured overgo probes, and added back to the 
filters. Hybridization was performed overnight at 58°C. The hybridization 
solution was removed and filters were washed once in 2 X SSC, 0.1% SDS, 

20 followed by a 30 min wash in the same solution at 58°C. Filters were then 
washed in: 1 ) 1 .5 X SSC and 0.1 % SDS at 58°C for 30 min; 2) 0.5 X SSC and 
0.1% SDS at 58°C for 30 min; and finally in 3) 0.1 X SSC and 0.1% SDS at 
58°C for 30 min. Filters were then wrapped in Saran Wrap and exposed to film 
overnight. To remove bound probe, filters were treated in 0.1 X SSC and 0.1% 

25 SDS pre-warmed to 95°C and cooled room temperature. Clone addresses 
were determined as described by instructions supplied by RPCI. 

To recover clonal BAC cultures from the library, a sample from the 
appropriate library well was plated by streaking onto LB agar (T. Maniatis et al., 
1982, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, 

30 Cold Spring Harbor, NY) containing 12.5 ug/mi chloramphenicol (Sigma),-and 
plates were incubated overnight. A single colony and a portion of the initial 
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streak quadrant were inoculated into 400 ul LB plus chloramphenicol in wells 
of a 96 well plate. Cultures were grown overnight at 37°C. For storage, 100 
ul of 80% glycerol was added and the plates placed at -80°C. 

To determine the marker content of clones, aliquots of the 96 well plate 
5 cultures were transferred to the surface of nylon filters (GeneScreen Plus, 
NEN) placed on LB/chloramphenicol Petri , plates. Colonies were grown 
overnight at 37°C and colony lysis was performed by placing filters on pools of: 
1 ) 1 0% SDS for 3 min; 2) 0.5 N NaOH and 1 .5 M NaCI for 5 min; and 3) 0.5 
M Tris-HCl, pH 7.5, and 1 M NaCI for 5 min. Filters were then air-dried and 

10 washed free of debris in 2 X SSC for 1 hr. The filters were air-dried for at least 
1 hr and DNA was crosslinked linked to the membrane using standard 
conditions. Probe hybridization and filter washing were performed as 
described above for the primary library screening. Confirmed clones were 
stored in LB containing 15% glycerol. 

15 In certain cases, polymerase chain reaction (PCR) was used to confirm 

the marker content of clones. PCR conditions for each primer pair were initially 
optimized with respect to MgCfe concentration. The standard buffer was 10 
mM Tris-HCl (pH 8.3), 50 mM KCI, MgCI 2 , 0.2 mM each dNTP, 0.2 uM each 
primer, 2.7 ng/ul human DNA, 0.25 units of AmpliTaq (Perkin Elmer) and MgCI 2 

20 concentrations of 1.0 mM, 1.5 mM, 2.0 mM or 2.4 mM. Cycling conditions 
included an initial denaturation at 94°C for 2 min followed by 40 cycles at 94°C 
for 1 5 sec, 55°C for 25 sec, and 72°C for 25 sec followed by a final extension 
at 72°C for 3 min. Depending on the results from the initial round of 
optimization the conditions were further optimized if necessary. Variables 

25 included increasing the annealing temperature to 58°C or 60°C, increasing the 
cycle number to 42 and the annealing and extension times to 30 sec, and using 
AmpliTaqGold (Perkin Elmer). 

5. BAC DNA Preparation : Several different types of DNA 
preparation methods were used for isolation of BAC DNA. The manual 

30 alkaline lysis miniprep protocol listed below (Maniatis et al., 1982) was 
successfully used for most applications, i.e., restriction mapping, CHEF gel 
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analysis and FISH mapping, but was not reproducibly successful in 
endsequencing. The Autogen protocol described below was used specifically 
for BAG DNA preparation for endsequencing. 

For manual alkaline lysis BAC minipreps, bacteria were grown in 15 ml 
5 terrific broth (TB) containing 12.5 ug/ml chloramphenicol. Cultures were placed 
in a 50 ml conical tube at 37°C for 20 hr with shaking at 300 rpm. The cultures 
were centrifuged in a Sorvall RT 6000 D at 3000 rpm (1 800 x g) at 4°C for 1 5 
min. The supernatant was then aspirated as completely as possible. In some 
cases cell pellets were frozen at -20°C at this step for up to 2 weeks. The 

10 pellet was then vortexed to homogenize the cells and minimize clumping. 
Following this, 250 ul of P1 solution (50 mM glucose, 15 mM Tris-HCI, pH 8, 
10 mM EDTA, and 100 ug/ml RNase A) was added and the mixture pipetted 
up and down to mix. The mixture was then transferred to a 2 ml Eppendorf 
tube. Subsequently, 350 u) of P2 solution (0.2 N NaOH, 1 % SDS) was added, 

15 mixed gently, and the mixture was incubated for 5 min at room temperature. 
Then, 350 ul of P3 solution (3 M KOAc, pH 5.5) was added and mixed gently 
until a white precipitate formed. The solution was incubated on ice for 5 min 
and then centrifuged at 4°C in a microfuge for 10 min. 

The supernatant was transferred carefully (avoiding the white 

20 precipitate) to a fresh 2 ml Eppendorf tube, and 0.9 ml of /sopropanol was 
added; the solution was mixed and left on ice for 5 min. The samples were 
centrifuged for 10 min, and the supernatant removed carefully. Pellets were 
washed in 70% ethanol and air-dried for 5 min. Pellets were resuspended in 
200 ul of TE8 (10 mM Tris-HCI, pH 8.0, 1.0 mM EDTA, pH 8.0), and RNase 

25 (Boehringer Mannheim, http://biochem.boehringer-mannheim.com) added to 
100 ug/ml. Samples were incubated at 37°C for 30 min, then precipitated by 
addition of NH 4 OAc to 0.5 M and 2 volumes of ethanol. Samples were 
centrifuged for 10 min, and the pellets were washed with 70% ethanol. The 
pellets were air-dried and dissolved in 50 pi TE8. Typical yields for this DNA 

30 prep were 3-5 ug per 15 ml bacterial culture. Ten to 15 pi of DNA was used 
for EcoRI restriction analysis; 5 ul was used for Not) digestion and clone insert 
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sizing by CHEF gel electrophoresis. 

Autogen 740 BAC DNA preparations for endsequencing were made by 
dispensing 3 mi of LB media containing 12.5 ug/ml of chloramphenicol into 
autoclaved Autogen tubes. A single tube was used for each clone. For 
5 inoculation, glycerol stocks were removed from ~70°C storage and placed on 
dry ice. A small portion of the glycerol stock was removed from the original 
tube with a sterile toothpick and transferred into the Autogen tube. The 
toothpick was left in the Autogen tube for at least two min before discarding. 
After inoculation the tubes were covered with tape to ensure that the seal was 

10 tight. When all samples were inoculated, the tubes were transferred into an 
Autogen rack holder and placed into a rotary shaker. Cultures were incubated 
at 37°C for 16-17 hr at 250 rpm. Following this, standard conditions for BAC 
DNA preparation, as defined by the manufacturer, were used to program the 
Autogen. However, samples were not dissolved in TE8 as part of the program. 

15 DNA pellets were left dry. 

When the program was completed, the tubes were removed from the 
output tray and 30 ul of sterile distilled and deionized H 2 0 was added directly 
to the bottom of the tube. The tubes were then gently shaken for 2-5 sec and 
then covered with parafilm and incubated at room temperature for 1-3 hr. DNA 

20 samples were then transferred to an Eppendorf tube and used either directly 
for sequencing or stored at 4°C for later use. 

6. BAC Clone Characterization : DNA samples prepared either by 
manual alkaline lysis or the Autogen protocol were digested with EcoRl for 
analysis of restriction fragment sizes. These data were used to compare the 

25 extent of overlap among clones. Typically 1-2 ug were used for each reaction. 
Reaction mixtures included: 1 X Buffer 2 (NEB); 0.1 mg/ml BSA (NEB); 50 
ug/m! RNase A (Boehringer Mannheim); and 20 units of EcoRl (NEB) in a final 
volume of 25 pi. Digestions were incubated at 37°C for 4-6 hr. BAC DNA was 
also digested with A/of I for estimation of insert size by CHEF gel analysis (see 

30 below). Reaction conditions were identical to those for EcoRl, except that 20 
units of Not\ were used. Six microliters of 6 X Ficoll loading buffer containing 
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bromphenol blue and xylene cyanol was added prior to electrophoresis. 

EcoRI digests were analyzed on 0.6% agarose (Seakem, FMC 
Bioproducts, Rockland, ME) in 1XTBE containing 0.5 ug/ml ethidium bromide. 
Gels (20 cm x 25 cm) were electrophoresed in a Model A4 electrophoresis unit 
5 (Owl Scientific) at 50 volts for 20-24 hr. Molecular weight size markers 
included undigested lambda DNA, Hind\\\ digested lambda DNA, and Haelll 
digested .X1 74 DNA. Molecular weight markers were heated at 65°C for 2 min 
prior to loading the gel. Images were captured with a Kodak DC40 CCD 
camera and analyzed with Kodak 1 D software. 

10 Nofl digests were analyzed on a CHEF DRIl (Bio-Rad) electrophoresis 

unit according to the manufacturer's recommendations. Briefly, 1% agarose 
gels (Bio-Rad pulsed field grade) were prepared in 0.5 X TBE, equilibrated for 
30 min in the electrophoresis unit at 14 °C, and electrophoresed at 6 volts/cm 
for 14 hr with circulation. Switching times were ramped from 10 sec to 20 sec. 

15 Gels were stained after electrophoresis in 0.5 ug/ml ethidium bromide. 
Molecular weight markers included undigested lambda DNA, Hind\\\ digested 
lambda DNA, lambda ladder PFG ladder, and low range PFG marker (all from 
NEB). 

7. BAC Endsequencing : The sequence of BAC insert ends utilized 
20 DNA prepared by either of the two methods described above. The ends of 
BAC clones were sequenced for the purpose of filling gaps in the physical map 
and for gene discovery information. The following vector primers specific to the 
BAC vector pBACe3.6 were used to generate endsequence from BAC clones: 
pBAC 5'-2 (TGT AGG ACT ATA TTG CTC; SEQ ID NO:56) and pBAC 3'-1 
25 (CGA CAT TTA GGT GAC ACT; SEQ ID NO:57). 

The ABI dye-terminator sequencing protocol was used to set up 
sequencing reactions for 96 clones. The BigDye (ABI; PE Applied Biosystems) 
Terminator Ready Reaction Mix with AmpliTaq" FS, Part number 43031 51 , was 
used for sequencing with fluorescently labeled dideoxy nucleotides. A master 
30 sequencing mix was prepared for each primer reaction set including: 1600 pi 
of BigDye terminator mix (ABI; PE Applied Biosystems); 800 pi of 5 X CSA 
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buffer (ABi; PE Applied Biosystems); 800 pi of primer (either pBAC 5-2 or 
pBAC 3'-1 at 3.2 pM). The sequencing cocktail was vortexed to ensure it was 
well-mixed and 32 pi was aliquoted into each PCR tube. Eight microliters of 
the Autogen DNA for each done was transferred from the DNA source plate 
5 to a corresponding well of the PCR plate. The PCR plates were sealed tightly 
and centrifuged briefly to collect all the reagents. Cycling conditions were as 
follows: 1 ) 95°C for 5 min; 2) 95°C for 30 sec; 3) 50°C for 20 sec; 4) 65°C for 
4 min; 5) steps 2 through 4 were repeated 74 times; and 6) samples were 
stored at 4°C. 

10 At the end of the sequencing reaction, the plates were removed from the 

thermocycler and centrifuged briefly. Centri-Sep 96 plates were then used 
according to manufacturer's recommendations to remove unincorporated 
nucleotides, salts, and excess primers. Each sample was resuspended in 1.5 
pi of loading dye, and 1.3 pi of the mixture was loaded on ABI 377 Fluorescent 

1 5 Sequencers. The resulting endsequences were then used to develop markers 
to rescreen the BAC library for filling gaps and were also analyzed by BLASTN 
searching for EST or gene content. 

EXAMPLES: Subcloning and Sequencing of BAC RPCM1 1098L22 

The physical map of the chromosome 20 region provided the location 
20 of the BAC RPCI-1 1_1098L22 clone that contains Gene 216 (see Figure 6). 
The BAC RPCI-1 1__1098L22 clone was deposited as clone RP11-1098L22 
with the American Type Culture Collection (ATCC), 10801 University Blvd., 
Manassas, VA 20110-2209 USA, under ATCC Designation No. PTA-3171,on 
March 14, 2001 according to the terms of the Budapest Treaty. DNA 
25 sequencing of BAC, RPCl-1 1-1 098L22 from the region was completed. BAC 
RPCI-1 1 -1 098L22 DNA, (the "BAC DNA") was isolated according to one of two 
protocols: either a QIAGEN purification (QIAGEN, Inc., Valencia, CA, per 
manufacturer's instructions) or a manual purification using a method which was 
a modification of the standard alkaline lysis/Cesium Chloride preparation of 
30 plasmid DNA (see e.g., F.M. Ausubel et ai., 1997, Current Protocols in 
Molecular Biology, John Wiley & Sons, New York, NY). Briefly, for the manual 
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protocol, cells were pelleted, resuspended in GTE (50 mM glucose, 25 mM 
Tris-CI (pH 8), 10 mM EDTA) and lysozyme (50 mg/ml solution), followed by 
addition of NaOH/SDS (1% SDS and 0.2N NaOH) and then an ice-cold 
solution of 3M KOAc (pH 4.5-4.8). RnaseA was added to the filtered 
5 supernatant, followed by treatment with Proteinase K and 20% SDS. The DNA 
was then precipitated with isopropanol, dried, and resuspended in TE (10 mM 
Tris, 1 mM EDTA (pH 8.0)). The BAC DNA was further purified by cesium 
chloride density gradient centrifugation (Ausubel et al., 1997). 

Following isolation, the BAC DNA was hydrodynamically sheared using 

10 HPLC (Hengen et al., 1997, Trends in Biochem. ScL, 22:273-274) to an insert 
size of 2000-3000 bp. After shearing, the DNA was concentrated and 
separated on a standard 1% agarose gel. A single fraction, corresponding to 
the approximate size, was excised from the gel and purified by electroelution 
(Sambrook et al., 1 989). 

15 The purified DNA fragments were then blunt-ended using T4 DNA 

polymerase. The blunt-ended DNA was then ligated to unique BsfXl-linker 
adapters (5' GTCTTCACCACGGGG (SEQ ID NO:58) and 5* GTGGTGAAGAC 
(SEQ ID NO:59) in 100-1000 fold molar excess). These linkers were 
complimentary to the BsfXl-cut pMPX vectors, while the overhang was not self- 

20 complimentary. Therefore, the linkers would not concatemerize, nor would the 
cut-vector re-ligate to itself easily. The linker-adapted inserts were separated 
from unincorporated linkers on a 1% agarose gel and purified using GeneClean 
(BIO 101, Inc., Vista, CA). The linker-adapted insert was then ligated to a 
modified pBlueScript vector to construct a "shotgun" subclone library. The 

25 vector contained an out-of-frame lacZ gene at the cloning site, which became 
in-frame in the event that an adapter-dimer was cloned. Such adapter-dimer 
clones gave rise to blue colonies, which were avoided. 

All subsequent steps were based on sequencing by ABI377 automated 
DNA sequencing methods. Major modifications to the protocols are highlighted 

30 below. Briefly, the library was transformed into DH5-competent ceils 
(GibcoBRL, DH5-transformation protocol). Transformed cells were plated onto 
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antibiotic plates containing ampicillin and IPTG/X-gal. The plates were 
incubated overnight at 37°C. White colonies were identified and then used to 
plate individual clones for sequencing. The cultures were grown overnight at 
37°C. DNA was purified using a silica bead DNA preparation method (Ng et 
5 a!., 1996, Nucl. Acids Res., 24:5045-5047). In this manner, 25 ug of DNA was 
obtained per clone. 

These purified DNA samples were sequenced using ABI dye-terminator 
chemistry. The ABI dye terminator sequence reads were run on ABI377 
machines and the data were directly transferred to UNIX machines following 

10 lane tracking of the gels. All reads were assembled using PHRAP (P. Green, 
Abstracts of DOE Human Genome Program Contractor-Grantee Workshop V, 
Jan. 1996, p. 157) with default parameters and quality scores. The assembly 
was done at 8-fold coverage and yielded 1 contig, BAC RPCM 1-1098L22. 
SEQ ID NO:5 (Figure 7) comprises a portion of the BAC that includes the 

15 genomic sequence of Gene 216. 
EXAMPLE 6: Gene Identification 

Any gene or EST mapping to the interval based on public map data or 
proprietary map data was considered a candidate respiratory disease gene. 
Public map data were derived from several sources: the Genome Database 

20 (GDB, http://gdbwww.gdb.org/), the Whitehead Institute Genome Center 
(http://www-genome.wi.mit.edu/), GeneMap98, UniGene, OMIM, dbSTS and 
dbEST (NCBI, http://www.ncbi.nlm.nih.gov/), the Sanger Centre 
(http://www.sanger.ac.uk/), and the Stanford Human Genome Center 
(http://www-shgc.stanford.edu/). Proprietary data was obtained from 

25 sequencing genomic DNA (cloned into BACs) or cDNAs (identified by direct 
selection, screening of cDNA libraries or full length sequencing of IMAGE 
Consortium (http://www-bio.1 1 nl.gov/bbrp/image.html) cDNA clones). 

1 . Gene Identification from clustered DNA fragments . DNA 
sequences corresponding to gene fragments in public databases (GenBank 

30 and human dbEST) and proprietary cDNA sequences (IMAGE consortium and 
direct selected cDNAs) were masked for repetitive sequences and clustered 
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using the PANGEA Systems (Oakland, CA) EST clustering tool. The clustered 
sequences were then subjected to computational analysis to identify regions 
bearing similarity to known genes. This protocol included the following steps: 
a. The clustered sequences were compared to the publicly available 
5 UniGene database (NCBI) using the BLASTN2 algorithm (Altschul et al., 1997). 
The parameters for this search were: E = 0.05, v = 50, B = 50, where E was 
the expected probability score cutoff, V was the number of database entries 
returned in the reporting of the results, and B was the number of sequence 
alignments returned in the reporting of the results (Altschul et al., 1990). 
10 b. The clustered sequences were compared to the GenBank database 

(NCBI) using BLASTN2 (Altschul et al„ 1997). The parameters for this search 
were E=0.05, V=50, B= 50, where E, V, and B were defined as above. 

c. The clustered sequences were translated into protein sequences for 
all six reading frames, and the protein sequences were compared to a non- 
15 redundant protein database compiled from GenPept Swissprot PIR (NCBI). 

The parameters for this search were E = 0.05, V = 50, B = 50, where E, V, and 
B were defined as above. 

d. The clustered sequences were compared to BAC sequences (see 
below) using BLASTN2 (Altschul et al., 1997). The parameters for this search 

20 were E=0.05, V=50, B=50, where E, V, and B were defined as above. 

2. Gene Identification from BAC Genomic Sequence : Following 
assembly of the BAC sequences into contigs, the contigs were subjected to 
computational analyses to identify coding regions and regions bearing DNA 
sequence similarity to known genes. This protocol included the following steps: 

25 a. Contigs were degapped. The sequence contigs often contained 

symbols (denoted by a period symbol) that represented locations where the 
individual ABI sequence reads had insertions or deletions. Prior to automated 
computational analysis of the contigs, the periods were removed. The original 
data were maintained for future reference. 

30 b. BAC vector sequences were "masked" within the sequence by using 

the program crossmatch (P. Green, http:\\chimera.biotech. Washington. 
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edu\UWGC). Since the shotgun library construction detailed above left some 
BAC vector in the shotgun libraries, this program was used to compare the 
sequence of the BAC contigs to the BAC vector and to mask any vector 
sequence prior to subsequent steps. Masked sequences were marked by "X" 
5 in the sequence files, and remained inert during subsequent analyses. 

c. E. coli sequences contaminating the BAC sequences were masked 
by comparing the BAC contigs to the entire E. coli DNA sequence. 

d. Repetitive elements known to be common in the human genome 
were masked using CROSSMATCH (P. Green, University of Washington). In 

10 this implementation of crossmatch, the BAC sequence was compared to a 
database of human repetitive elements (J. Jerka, Genetic Information 
Research Institute, Palo Alto, CA). The masked repeats were marked by "X" 
and remained inert during subsequent analyses. 

e. The location of exons within the sequence was predicted using the 
1 5 MZEF computer program (Zhang, 1997, Proc. Natl. Acad. Sc/., 94:565-568)and 

GenScan gene prediction program (Burge and Karlin, J. Mol. B/o/., 268:78-94). 

f. The sequence was compared to the publicly available UniGene 
database (NCBI) using the BLASTN2 algorithm (Altschul et al., 1997). The 
parameters for this search were: E = 0.05, v = 50, B = 50, where E was the 

20 expected probability score cutoff, V was the number of database entries 
returned in the reporting of the results, and B was the number of sequence 
alignments returned in the reporting of the results (Altschul et al., 1990). 

g. The sequence was translated into protein sequences for all six 
reading frames, and the protein sequences were compared to a non-redundant 

25 protein database compiled from GenPept Swissprot PIR (NCBI). The 
parameters for this search were E = 0.05, V = 50, B = 50, where E, V, and B 
were defined as above. 

h. The BAC DNA sequence was compared to a database of clustered 
sequences using the BLASTN2 algorithm (Altschul et al., 1997). The 

30 parameters for this search were E=0.05, V=50, B=50, where E, V, and B were 
defined as above. The database of clustered sequences was prepared utilizing 
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a proprietary clustering technology (PANGEA Systems, Inc.) using cDNA 
clones derived from direct selection experiments (described below), human 
dbEST sequences mapping to the 20p13-p12 region, proprietary cDNAs, 
GenBank genes, and IMAGE consortium cDNA clones. 
5 i. The BAC sequence was compared to the sequences derived from the 

ends of BACs from the region on chromosomes 20 using the BLASTN2 
algorithm (Altschul et al. ( 1997). The parameters for this search were E=0.05, 
V=50, B= 50, where E, V, and B were defined as above. 

j. The BAC sequence was compared to the GenBank database (NCBl) 
10 using the BLASTN2 algorithm (Altschul et al., 1997). The parameters for this 
search were E = 0.05, V = 50, B = 50, where E, V, and B were defined as 
above. 

k. The BAC sequence was compared to the STS division of GenBank 
database (NCBl) using the BLASTN2 algorithm (Altschul et al., 1997). The 
15 parameters for this search were E=0.05, V=50, B= 50, where E, V, and B were 
defined as above. 

I. The BAC sequence was compared to the Expressed Sequence Tag 
(EST) GenBank database (NCBl) using the BLASTN2 algorithm (Altschul et al., 
1997). The parameters for this search were E=0.05, V=50, B= 50, where E, 
20 V, and B were defined as above. 

c. Mapping Analysis 

Through mapping analysis, BAC RPCI-1 1_1098L22 (ATCC 
Designation No. PTA-3171) was identified as containing Gene 216. This 
BAC sequence (SEQ ID NO:5, Figure 7) included the genomic sequence of 

25 Gene 216 (SEQ ID NO:6; Figure 29), which corresponded to the cDNA 
sequence of Gene 216 (SEQ ID NO:1; Figure 24). 
EXAMPLE 7: Gene 216 cDNA Cloning and Expression Analysis 

1. Construction and screening of cDNA libraries : Directionally 
cloned cDNA libraries from normal lung and bronchial epithelium were 

30 constructed using standard methods (Soares et. a!., 1994, Automated DNA 
Sequencing and Analysis, Adams et al. (eds), Academic Press, NY, pp. 110- 
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1 14). Total and cytoplasmic RNAs were extracted from tissue or cells by 
homogenizing the sample in the presence of Guanidinium Thiocyanate- 
Phenol-Chloroform extraction buffer (e.g. Chomczynski and Sacchi, 1987, 
Anal. Biochem., 162:156-159) using a poiytron homogenizer (Brinkman 
5 Instruments, http://www.brinkmann.com). Poly A + RNA was isolated from 
total/cytoplasmic RNA using dynabeads-dT according to the manufacturer's 
recommendations (Dynal, Inc., http://www.dynal.com). The double stranded 
cDNA was then ligated into the plasmid vector pBluescript II KS+ 
(Stratagene, http://www.stratagene.com), and the ligation mixture was 

10 transformed into E. cqli host DH10B or DH12S by electro po ration (Soares, 
1994). Following overnight growth at 37°C, DNA was recovered from the E. 
coli colonies after scraping the plates by processing as directed for the 
Mega-prep kit (QIAGEN). The quality of the cDNA libraries was estimated 
by counting a portion of the total number of primary transformants, 

15 determining the average insert size, and the percentage of piasmids with no 
cDNA insert. Additional cDNA libraries (human total brain, heart, kidney, 
leukocyte, and fetal brain) were purchased from Life Technologies 
(Bethesda, MD). 

cDNA libraries, both oligo (dT) and random hexamer-primed, were 
20 used for isolating cDNA clones mapped within the disorder critical region. 
Four 10x10 arrays of each of the cDNA libraries were prepared as follows. 
The cDNA libraries were titered to 2.5 x 10 6 using primary transformants. 
The appropriate volume of frozen stock was used to inoculate 2 L of 
LB/ampicillin (100 ug/ul). Four hundred aliquots containing 4 ml of the 
25 inoculated liquid culture were generated. Each tube contained about 5000 
cfu (colony forming units). The tubes were incubated at 30°C overnight with 
shaking until an OD of 0.7-0.9 was obtained. Frozen stocks were prepared 
for each of the cultures by aliquotting 300 ul of culture and 100 ul of 80% 
glycerol. Stocks were frozen in a dry ice/ethanol bath and stored at -70°C. 
30 DNA was isolated from the remaining culture using the QIAGEN spin mini- 
prep kit according to the manufacturer's instructions. The DNA from the 400 
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cultures were pooled to make 80 column and row pools. Markers were 
designed to amplify putative exons from candidate genes. Once a standard 
PCR condition was identified and specific cDNA libraries were determined 
to contain cDNA clones of interest, the markers were used to screen the 
5 arrayed library. Positive addresses indicating the presence of cDNA clones 
were confirmed by a second PCR using the same markers. 

Once a cDNA library was identified as likely to contain cDNA clones 
corresponding to a transcript of interest from the disorder critical region, it 
was used to isolate a clone or clones containing cDNA inserts. This was 

10 accomplished by a modification of the standard "colony screening" method 
(Sambrook et al., 1989). Specifically, twenty 150 mm LB plus ampicillin agar 
plates were spread with 20,000 cfu of cDNA library. Colonies were allowed 
to grow overnight at 37° C. Colonies were then transferred to nylon filters 
(Hybond from Amersham-Pharmacia, or equivalent) and duplicates prepared 

15 by pressing two filters together essentially as described (Sambrook et al., 
1989). The "master" plate was then incubated an additional 6-8 hr to allow 
the colonies additional growth. The DNA from the bacterial colonies was 
then bound to the nylon filters by treating the filters sequentially with 
denaturing solution (0.5 N NaOH, 1.5 M NaCI) for 2 min, and neutralization 

20 solution (0.5 M Tris-CI pH 8.0, 1.5 M NaCI) for 2 min (twice). The bacterial 
colonies were removed from the filters by washing in a solution of 2 X 
SSC/2% SDS for 1 min while rubbing with tissue paper. The filters were air- 
dried and baked under vacuum at 80°C for 1-2 hr to crosslink the DNA to the 
filters. 

25 cDNA hybridization probes were prepared by random hexamer 

labeling (Fineberg and Vogelstein, 1983, Anal. Biochem., 132:6-13) or by 
including gene-specific primers and no random hexamers in the reaction (for 
small fragments). The colony membranes were then pre-washed in 10 mM 
Tris-CI pH 8.0, 1 M NaCI, 1 mM EDTA, 0.1% SDS for 30 min at 55°C. 

30 Following the pre-wash, the filters were pre-hybridized in > 2 ml/filter of 6 X 
SSC, 50 % deionized formamide, 2% SDS, 5 X Denhardt's solution, and 100 
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mg/ml denatured salmon sperm DNA, at 42°C for 30 min. The filters were 
then transferred to hybridization solution (6 X SSC, 2% SDS, 5 X 
Denhardt's, 100 mg/ml denatured salmon sperm DNA) containing denatured 
a- 32 P-dCTP-labeled cDNA probe and incubated overnight at 42°C. 
5 The following morning, the filters were washed under constant 

agitation in 2 X SSC, 2% SDS at room temperature for 20 min, followed by 
two washes at 65°C for 1 5 min each. A second wash was performed in 0.5 
X SSC, 0.5% SDS for 15 min at 65°C. Filters were then wrapped in plastic 
wrap and exposed to radiographic film. Individual colonies on plates were 

10 aligned with the autoradiograph and positive clones picked into a 1 ml 
solution of LB Broth containing ampicillin. After shaking at 37°C for 1-2 hr, 
aliquots of the solution were plated on 150 mm plates for secondary 
screening. Secondary screening was identical to primary screening (above) 
except that it was performed on plates containing -250 colonies so that 

15 individual colonies could be clearly identified. Positive cDNA clones were 
characterized by restriction endonuclease cleavage, PCR, and direct 
sequencing to confirm the sequence identity between the original probe and 
the isolated clone. 

To obtain the full-length cDNA, novel sequence from the 5'-end of the 

20 clone was used to rep robe the library. This process was repeated until the 
length of the cDNA cloned matched that of the mRNA, estimated by 
Northern analysis. Utilizing this process, a single uterus clone was isolated 
and deposited as clone Gene 216_CS759 with the American Type Culture 
Collection (ATCC), 10801 University Blvd., Manassas, VA 20110-2209 USA, 

25 under ATCC Designation No. PTA-3173, on March 14, 2001, according to 
the terms of the Budapest Treaty. The uterus clone (SEQ ID NO:3) 
contained the entire Gene 216 open reading frame. Both strands of this 
clone were completely sequenced and the data were compared against the 
BAC sequence. Any discrepancies were flagged, and these regions were 

30 resequenced. The final analysis of the sequence revealed that the uterine 
clone was 3433 bp long and contained the full complement of exons defining 
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the open reading frame (SEQ ID NO:3). In addition, the clone contained a 
small portion of the 5' untranslated region (5 bp), the entire 3' untranslated 
region with a polyadenylation signal, and a poly A tail of 76 bp in length. 
The Gene 216 open reading frame was determined to be 2436 bp in length 
5 and to encode a protein of 812 amino acids (SEQ ID NO:363) . Analysis of 
the composition of SNPs across the cDNA clone revealed that it contained 
the most frequent haplotype (Figure 8, see below). 

Rapid Amplification of cDNA ends (RACE) was performed following 
the manufacturer's instructions using a Marathon cDNA Amplification Kit 

10 (CLONTECH) as a method for cloning the 5' and 3' ends of candidate 
genes. cDNA pools were prepared from total RNA by performing first strand 
synthesis. For first strand synthesis, a sample of total RNA sample was 
mixed with a modified oligo (dT) primer, heated to 70° C, cooled on ice and 
incubated with: 5 X first strand buffer (CLONTECH), 10 mM dNTP mix, and 

15 AMV Reverse Transcriptase (20 U/ul). The reaction mixture was incubated 
at 42°C for 1 hr and placed on ice. For second-strand synthesis, the 
following components were added directly to the reaction tube: 5 X second- 
strand buffer (CLONTECH), 10 mM dNTP mix, sterile water, and 20 X 
second-strand enzyme cocktail (CLONTECH). The reaction mixture was 

20 incubated at 16°C for 1.5 hr. T4 DNA Polymerase was added to the 
reaction mixture and incubated at 16°C for 45 min. The second-strand 
synthesis was terminated with the addition of an EDTA/Glycogen mix. The 
sample was purified by phenol/chloroform extraction and ammonium acetate 
precipitation. The cDNA pools were checked for quality by analyzing on an 

25 agarose gel for size distribution. Marathon cDNA adapters were then ligated 
onto the cDNA ends. The specific adapters contained priming sites that 
allowed for amplification of either 5' or 3' ends, and varied depending on the 
orientation of the gene specific primer (GSP) that was chosen. An aliquot 
of the double stranded cDNA was added to the following reagents: 1 0 uM 

30 Marathon cDNA adapter, 5 X DNA ligation buffer, T4 DNA ligase. The 
reaction was incubated at 16°C overnight and heat inactivated to terminate 
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the reaction. PCR was performed by the addition of the following to the 
diluted double stranded cDNA pool: 10X cDNA PCR reaction buffer, 10 uM 
dNTP mix, 10 uM GSP, 10 uM AP1 primer (kit), 50 X Advantage cDNA 
Polymerase Mix. Thermal Cycling conditions were carried out at 94°C for 
5 30 sec; 5 cycles of 94°C for 5 sec, 72°C for 4 min, 5 cycles of 94°C for 5 
sec, and 70°C for 4 min; 23 cycles of 94°C for 5 sec; 68°C for 4 min. The 
first round of PCR was performed using the GSP to extend to the end of the 
adapter to create the adapter primer-binding site. Following this, 
exponential amplification of the specific cDNA of interest was performed. 

10 Usually, a second, nested PCR was performed to provide specificity. The 
RACE product was analyzed on an agarose gel. Following excision from the 
gel and purification (GeneClean, BIO 101), the RACE product was then 
cloned into pCTNR (Genera! Contractor DNA Cloning System, 5' - 3', Inc.) 
and sequenced to verify that the clone was specific to the gene of interest. 

15 - The 5' RACE technique was employed to identify the 5' untranslated 

region of Gene 216. Experiments were performed using lung mRNA and a 
primer that hybridized near the 5' end of the available sequence. The result of 
the experiment identified an additional 75 bp 5' of that present in the uterus 
cDNA clone (rt690; SEQ ID NO:351 ). This sequence was subsequently cloned 

20 and deposited with the ATCC (American Type Culture Collection, 10801 
University Blvd., Manassas, VA 20110-2209 USA), as clone Gene 216_rt690, 
under ATCC Designation No.PTA-3172 on March 14, 2001, according to the 
terms of the Budapest Treaty. 

Further attempts to extend the 5' end of Gene 216 by 5' RACE gave 

25 similar results indicating that the 5' end of the transcript was obtained. 

This sequence in combination with the uterus cDNA clone yielded the 
master consensus sequence containing the 5' to 3' cDNA for Gene 216 (SEQ 
ID NO:1; Figure 24). 

2. Identification of Splice Variants : Additional cDNA clones were 

30 isolated that represented alternatively spliced variants of Gene 216. To ensure 
that all splice variants present in lung tissue were identified, an RT-PCR-based 
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screening protocol was designed using multiple primer pairs spanning the 
entire gene. These amplicons produced PCR fragments of approximately 600 
bp and overlapped by approximately 100 bp. The PCR products were 
fractionated on agarose gels and any fragments that were different from the 
5 expected size were cloned and sequenced. These results are summarized in 
Figures 9 and 10. The availability of the complete genomic sequence of BAC 
RPCI-11_1098L22 enabled the intron/exon structure of Gene 216 (Figure 11) 
to be determined. Gene 216 contains 21 exons that span approximately 23.5 
kb of genomic DNA. 

10 Analysis of the sequence surrounding the intron/exon boundaries 

indicated that the consensus splice sequence GT/AG was upheld in all cases 
(Table 4). However, in several of the cDNA clones, an alternative use of a 
splice site at the intron/exon boundary of exon T was identified. The sequence 
CAGCAG was present at the border of intron ST and exon T resulting in a 

15 duplication of the canonical acceptor splice consensus CAG. Typically, a C 
residue preceding the AG is found in approximately 65% of acceptor splice 
sites. As a consequence, the splicing machinery can utilize either AG resulting 
in the presence or absence of an alanine. If the first AG (splice site 1) were 
utilized near the junction of intron ST and exon T, the resulting protein would 

20 encode the amino acid sequence DPQADQVQM (Figure 12) (SEQ ID NO:60). 
However, if the second AG (splice site 2) were favored, then one alanine 
would be omitted from the amino acid sequence and the protein would contain 
the amino acid sequence DPQDQVQM (Figure 12) (SEQ ID NO:61). The 
percentage that used splice site 1 or splice site 2 could not be determined from 

25 the dataset because the majority of the clones were derived from PCR-based 
techniques. 
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TABLE 4 



EXON 


3'INTRON 


5' EXON 


3* EXON 


5" INT RON 


A 






AAG 


GTGAGG 


B 


CAG 


GAC 


CCG 


GTCAGT 


C 


CAG 


GTC 


CCA 


GTGAGT 


D 


CAG 


CAG 


ACG 


GTGAGA 


D (ALT) 


CAG 


CAG 


GAG 


GTACCC 


E 


TAG 


GAT 


GAG 


GTGAGC 


F 


TAG 


TGG 


AGG 


GTCAGG 


G 


CAG 


GGC 


CTG 


GTGAGG 


H 


CAG 


TTC 


CAG 


GTTGGG 


1 


CAG 


CTT 


CAC 


GTGGGT 


J 


CAG 


GGG 


ACG 


GTGAGC 


K 


CAG 


GAC 


CGG 


GTACGC 


L 


TAG 


GCA 


CAG 


GTTAAG 


L2 


CAG 


GAG 


CTG 


GTGAGG 


M 


CAG 


CTG 


CTG 


GTGAGA 


N 


CAG 


GCT 


GAG 


GTAGGG 


O 


CAG 


GGA 


ATG 


GTGAGC 


0 (ALT) 


TAG 


ATG 


ATG 


GTGAGC 


U 


TAG 


GTG 


GGG 


GTGAGA 


P 


CAG 


GTT 


AAA 


GTATGC 


Q 


CAG 


ACC 


TGG 


GTAGGC 


R 


CAG 


CCC 


TGG 


GTGAGT 


S 


CAG 


ACC 


AAG 


GTAGGC 


T 


CAG 


CAG 
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3. Promoter Analysis : In order to identity the transcriptional start site 
of Gene 216, multiple 5' RACE products were sequenced from several different 
5 tissues. In most cases the 5' ends were located 80 bp upstream of the 
translational start site. The region upstream of this sequence was then 
analyzed for potential transcription factor binding sites using GEMS Launcher, 
a promoter analysis program (http://anthea.gsf.de/). GEMS Launcher uses 
statistically weighted algorithms to identify binding elements that comprise a 

1 0 promoter or regulatory module. A stretch of DNA sequence spanning the 2000 
bp upstream of the translational start site was analyzed. The results indicated 
that Gene 216 did not possess a TATA or CCAAT box. In fact, the first binding 
element that was identified was a GC box within the 5' untranslated region 
oriented in the opposite direction (Figure 13). This result is not unprecedented 

15 since 60% of TATA-less genes possess a GC box on the opposing strand. 
Also, this result was in agreement with published data regarding the promoters 
of mouse ADAM 17 and 19. Other binding elements that were identified within 
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600 bp upstream of the initiator methionine included an E-box, one AP2, and 
three SP1 sites (Figure 13). These types of binding elements were also 
identified in the mouse ADAM 17 and 19 genes and may represent 
components of a promoter module for Gene 216. Approximately 1200 bp 
5 upstream of the putative promoter module, GEMS Launcher identified binding 
elements that may comprise an additional regulatory element (Figure 13). This 
region was highly conserved with the mouse ortholog of Gene 216 (see below), 
as determined by dot matrix analysis. 

4. BLAST Analysis : B LAS TP, BLASTN, and BLASTX analysis of 

10 Gene 216 against protein and nucleotide databases revealed that it was a 
novel member of the ADAM (A Disintegrin And Metalloprotease) gene family. 
This gene family, of which there are currently 31 members, is a sub-group of 
the zinc-dependent metalloprotease superfamily. ADAMs have a complex 
domain organization that includes a signal sequence, propeptide, 

15 metalloprotease, disintegrin, cysteine- rich, and epidermal growth factor-like 
domains, as well as a transmembrane region and cytoplasmic tail. ADAM 
proteins have been implicated in many processes such as proteolysis in the 
secretory pathway and extracellular matrix, extra- and intra-cellular signaling, 
processing of plasma membrane proteins and procytokine conversion. The 

20 homology of Gene 216 and human ADAMs 19, 12, 15, 8 and 9 indicated that 
Gene 216 belonged to a branch of the 31 -member family containing active 
metalloprotease domains (Figure 14). 

6. Expression Analysis : To characterize the expression of Gene 216, 
a series of expression experiments were performed. 

25 i. Northern Analysis : To characterize novel genes, Northern 

analysis (Sambrook et al. r 1989) can be used to determine the length, in 
nucleotides, of the processed transcript or messenger RNA (mRNA). Probes 
were generated using one of the methods described below. Briefly, sequence 
verified IMAGE consortium cDNA clones were digested with appropriate 

30 restriction endonucleases to release the insert. The restriction digest was 
electrophoresed on an agarose gel and the bands containing the insert were 
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excised. The gel piece containing the DNA insert was placed in a Spin-X 
(Corning Costar Corporation, Cambridge, MA) or Supelco spin column 
(Supelco Park, PA) and spun at high speed for 15 min. The DNA was ethanoi 
precipitated and resuspended in TE. Alternatively, PCR products obtained 
5 from genomic DNA or RT-PCR were purified. First, oligonucleotide primers 
were designed for use in the polymerase chain reaction (PCR) so that portions 
of the cDNA, EST, or genomic DNA could be amplified from a pool of DNA 
molecules or RNA population (RT-PCR). The PCR primers were used in a 
reaction containing genomic DNA to verify that they generated a product of the 

10 predicted size (based on the genomic sequence. Inserts purified from IMAGE 
clones or PCR products were random primer labeled (Fineberg and Vogelstein, 
supra) to generate probes for hybridization. Probes from purified PCR 
products were generated by incorporation of a- 32 P-dCTP in second round of 
PCR. Commercially available Multiple Tissue Northern blots (CLONTECH) 

15 were hybridized and washed under conditions recommended by the 
manufacturer. A separate filter that contained 6 tissues from the immune 
system was also utilized. The results revealed a major 5.0 kb transcript and 
a minor 3.5 kb transcript that were expressed in most tissues examined 
(Figures 15A-15B). The strongest signals were consistently identified in heart, 

20 skeletal muscle, colon, lymph, and small intestine, with lung, liver, kidney, 
placenta, bone marrow, and brain showing moderate expression levels. 

The 5 kb transcript was further analyzed to determine if it was an 
incompletely spliced version of the Gene 216 transcript. To test this 
hypothesis, Northern blotting was performed using cytoplasmic mRNA isolated 

25 from bronchial smooth muscle cells. The same radioactive probe was 
employed as previously. The results showed a veiystrong 3.5 kb signal and 
no signal at 5.0 kb (Figure 15C) suggesting that the predominant 5 kb 
transcript contained intronic material and was localized to the nucleus. 
Interestingly, intron QR is 1.4 kb in size. The addition of the QR intron and the 

30 3.5 kb full length cDNA would total -5.0 kb. Accordingly, there may be 
regulatory elements within the region around intron QR that affect splicing, 
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retention in the nucleus, and/or transport to the cytoplasm. 

ii. RNA Dot Blot Analysis : RNA dot blotting was used to 
determine the expression of Gene 216 in a wide range of tissues. mRNA 
from 50 tissues was dotted onto a nylon filter, and a radioactive probe 

5 designed to hybridize to the 3' untranslated region was used. Figure 16 
shows that Gene 216 was highly expressed in gastrointestinal tissues as 
well as aorta, uterus, prostate, ovary, lung, fetal lung, trachea and placenta. 
Notably, the majority of these tissues are derived from the endoderm, which 
forms a tube that produces the primordium of the digestive tract. Extensions 
10 from this wall also develop into organs such as the lung and trachea. 

iii. RT-PCR : Total RNA isolated from primary cultures of seven cell 
types cultured from lung tissue was analyzed in RT-PCR experiments. 
Genomic DNA was removed from the total RNA by DNasel digestion. The 
"Superscript 1 Preamplification System for First strand cDNA synthesis" (Life 

15 Technologies) was used according to manufacturer's specifications with 
oligo(dT) or random hexamers to synthesize cDNA from the DNasel treated 
total RNA. Gene specific primers were used to amplify the target cDNAs in a 
30 PCR reaction containing 0.5 /il of first strand cDNA, 1 jul sense primer (10 
HM), 1 ul antisense primer (10 p,M), 3 ul dNTPs (2 mM), 1.2 u! MgCI 2 (25 mM), 

20 3 ul 10 X PCR buffer and 1 unit of Taq Polymerase (Perkin Elmer). The PCR 
reaction was initially incubated at 94°C for 4 min, followed by 30 cycles of 
incubation at 94°C for 30 sec, 58°C for 1 min, and 72°C for 1 min; then 
followed by a final incubation at 72° C for 7 min. PCR products were analyzed 
on agarose gels. Figure 17 shows that Gene 216 was expressed in lung 

25 fibroblasts, pulmonary artery smooth muscle cells, bronchial smooth muscle 
cells and total lung, but not in bronchial epithelium or pulmonary artery 
endothelial cells. 

iv. cDNA Library Representation : A comprehensive approach 
to determining the tissue distribution of Gene 216 was performed in silico by 

30 mining the public EST database and Genome Therpaeutics Corporation's 
internal cDNA database. BLAST analysis identified ESTs from multiple cDNA 
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libraries. A summary of all tissues expressing Gene 216 is given in Table 5. 

TABLE 5 



Source 


Tissue 


UNIGENE 


Eye 




Muscle 




Placenta 




Stomach 




Uterus 




Whole embryo 




Breast 




Normal testis 


Direct selected cDNAs 


Bronchial smooth muscle (1 clone) 




Normal lung (2 clones) 




Brain (1 clone) 


Primary cell types (RT/PCR) 


Pulmonary artery smooth muscle 




Bronchial smooth muscle 




Lung fibroblast 




Total lung 


RNA Dot Blot 


Aorta 




Colon 




Bladder 




Uterus 




Prostate 




Ovary 




Small intestine 




Heart 




Stomach 




Testis 




Appendix 




Lung 




Trachea 




Fetal kidney 




Fetal lung 


Northern Blot 


Brain 




Heart 




Skeletal muscle 




Colon 




Thymus 




Spleen 




Kidney 




Liver 




Small intestine 




Placenta 




Lung 




Lymph 




Bone marrow 
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EXAMPLE 8: Gene 216 Polypeptide 

1 . ADAM Family Features: The zinc-dependent metalioprotease 
superfamiiy is comprised of several sub-groups. Those proteases that exhibit 
5 the characteristic Zn-binding consensus sequence HEXXHXXGXXH (SEQ ID 
NO: 62) are referred to as zincins. The 3 histidines play an essential role in 
binding to the catalytically essential zinc ion. The zincins can be further 
classified into metzincins if a methionine residue is located beneath the active- 
site zinc ion ("Met-turn" motif). Within this sub-group there are 4 sub-families: 

10 astacins, matraxins, adamlysins, and serralysins. The ADAM genes fall within 
the adamlysins sub-family along with snake venom metalloproteases. 

Currently, there are 31 members of the ADAM family. The ADAM genes 
encode proteins of approximately 750 amino acids with 8 different domains. 
Domain I is a pre-domain and contains the signal sequence peptide that 

15 facilitates secretion through the plasma membrane. Domain II is a pro-domain 
that is cleaved before the protein is secreted resulting in activation of the 
catalytic domain. Domain III is a catalytic domain containing metalioprotease 
activity. Domain IV is a disintegrin-like domain and is believed to interact with 
integrins or other receptors. Domain V is a cysteine-rich domain and is 

20 speculated to be involved in protein-protein interactions or in the presentation 
of the disintegrin-like domain. Domain VI is an EGF-like domain that plays a 
role in stimulating membrane fusion. Domain VII is a transmembrane domain 
that anchors the ADAM protein to the membrane. Domain VIII is a cytoplasmic 
domain and contains binding sites for cytoskeletal-associated proteins and/or 

25 SH3 binding domains that may play a role in bi-directional signaling. See 
Figure 8 for the location of ADAM domains identified in the Gene 216 protein 
sequence. 

To determine whether Gene 216 was a novel member of the ADAM 

family, the 812 amino acid sequence was aligned by Pile-Up (Genetics 

30 Computer Group, http://www.gcg.com) (Figure 18). These analyses indicated 

that Gene 216 possessed the characteristic consensus sequence 

HEXXHXXGXXH (SEQ ID NO:62) located within the catalytic domain. In 
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addition, a methionine residue referred to as a "Met-turn" was identified in the 
Gene 216 protein. A conserved cysteine (amino acid 133 in Gene 216) that 
plays a role in activating ADAM proteins was identified in the prodomain of 
Gene 216 protein. In ADAM proteins, this single cysteine residue forms an 
5 intramolecular complex with the zinc ion bound to the metailoprotease domain 
and blocks the active site. The catalytic domain is activated by the dissociation 
of the cysteine from the complex, resulting in either a conformational change 
or enzymatic cleavage of the prodomain. This process is referred to as the 
"cysteine switch". 

10 In ADAM 12, the position of the cysteine residue was reported to be 

located in a different position in the prodomain (B.L. Gilpin et al., 1998, J. BioL 
Chem. 273:157-166). This location would correspond to the cysteine residue 
at amino acid 179 in Gene 216 (Figure 19). However, in accordance with 
analyses performed by Stone et al., using 14 ADAMs, including ADAMs 8, 9, 

15 12 and 15, the cysteine residue corresponding to position 133 of Gene 216 
(Figures 18 and. 19) was identified as being involved in the "cysteine switch". 
In addition, there appeared to be more sequence identity around the cysteine 
at amino acid 133 in Gene 216 than at position 179. This provided further 
support that the cysteine at position 133 was involved in the "cysteine switch". 

20 The alignment also indicated that the amino acid sequence of Gene 216 
contained all eight domains that define the hallmarks of these types of genes 
(Figure 18). 

Hydrophobicity analysis (PepPlot, Genetics Computer Group) of the 
Gene 216 amino acid sequence revealed the presence of two hydrophobic 

25 regions (Figure 20). One region is located at the amino terminus of the protein 
and is the putative the signal sequence. The other hydrophobic region is 
located near the carboxyl terminus and is the putative transmembrane domain 
that anchors the protein to the cell surface. Computational biology analysis 
(http://blocks.fhcrc.org) of the Gene 216 cytoplasmic domain revealed the 

30 presence of a putative SH2 and SH3 binding domain as well as a putative 
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casein kinase I phosphorylation site (Figure 19). These sites may contribute 
to a role in bi-directional signaling, a function attributed to ADAM proteins. 

Sequence analyses indicated that Gene 216 is a novel member of the 
ADAM family. Gene 21 6 is most closely related to ADAMs 8, 9, 12, 15, and 19, 
5 a branch of the family that is known to possess an active metalloprotease 
domain. Table 6 lists the 5 most similar BLASTP hits using the Gene 216 
amino acid sequence as a query. Based on BLASTN and BLASTP analysis, 
Gene 216 nucleotide sequence shares the 37% identity with the ADAM 19 
nucleotide sequence; and Gene 216 amino acid sequence shares 58% identity 
10 with the ADAM 19 amino acid sequence. 



Table 6: Top 5 Hits from BLAST Analysis of Gene 216 protein 



15 



H1L 



Gen Bank Locus Description 



Smallest Sum 



U66003 



Xenopus laevis (ADAM 13) 



5.5e-166 



2 


AF019887 


Mus musculus metal lop rotease- 
disintecrrin meitrin beta 


1.2e-139 


3 


AF1 34707 


Homo sapiens disintegrin and 
metalloprotease domain 19 (ADAM 19) 


1.6e-139 


4 


S60257 


Mouse mRNA for meitrin alpha 


1.8e-121 


5 


AF023476 


Homo sapiens meltrin-L precursor 
(AD AM 12) 


4.9e-119 



25 



30 



Table 7 lists the top two hits from BLIMPS analysis of the Block protein 
motif database (http://blocks.fhcrc.Org/V 

Table 7: Top 2 Hits from BLIMPS Analysis of Gene 216 protein 



Description 



Strength Score AA# AA 



Disintegrins proteins 1950 1597 377 



Sequence 



CCfAhnCsLRPGAQCAh- 
GdCCvRCIlKpAGal- 
CRqAMGDCDlPEfCT- 
GTSshCPP (SEQ ID NO:335) 



Zinc metallopeptidases 1 1 73 1276 276 TMAHEIGHSLG (SEQ ID NO:336) 
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2. Amino Acid Changes : In total, there were 9 SNPs within the open 
reading frame of Gene 216. See Example 10 for details on polymorphism 
identification and Figure 19 for resulting changes to the protein sequence. 
5 Seven of the nine SNPs constituted an amino acid change and the other 2 
were synonymous. Of the 7 amino acid changes, 4 were clustered toward the 
carboxyl terminus of the protein: one within the identified transmembrane 
domain and 3 within the identified cytoplasmic domain. 

One SNP located in an identified SH2 binding domain resulted in a 

1 0 significant amino acid change: methionine (hydrophobic) to threonine (polar). 
The remaining two SNPs in the identified cytoplasmic domain resulted in 
significant amino acid changes: proline (hydrophobic) to serine (polar) and 
glutamine (polar) to histidine (basic). These amino acid changes may disturb 
the signaling properties of the Gene 216 protein. In addition, the valine to 

15 isoleucine amino acid change in the putative transmembrane domain may 
affect signaling efficiency. 

The two SNPs in the identified pro-domain generated significant amino 
acid changes: tyrosine (poiar) to histidine (basic) and threonine (polar) to 
alanine (hydrophobic). Since the ADAM pro-domain is cleaved during 

20 activation of the catalytic domain, it is possible that these amino acid changes 
affect the cleavage process. One SNP in the identified catalytic domain 
resulted in a change from alanine (hydrophobic) to valine (hydrophobic). This 
amino acid change may affect sheddase efficiency. 

Notably, amino acid changes in the identified Gene 216 catalytic 

25 domain, especially within the metalloprotease domain, would be of great 
interest, as this domain is critical to sheddase function. Recently, the X-ray 
crystal lographic data of the snake venom catalytic domain was determined and 
deposited in the public domain (http://www.rcsb.org/pdb/cgi/explore.cgi? 
pid=9267984771616&pdbld=1C9G; Accession No. 1C9GA). This information 

30 can be utilized to determine whether an amino acid change alters the folding 
of the catalytic domain of the Gene 216 protein. In particular, the sequence of 
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the catalytic domain of Gene 216 protein can be plotted as X-ray 
crystallographic coordinates and used to determine changes in the tertiary 
structure of this domain. 

3. Biological Role of Gene 216 : ADAMs are part of a very large 
5 superfamily called zinc-dependent metalloproteases (Stone et. a!., 1999, J. 
Prot Chem. 18:447-465). Gene 216 represents a novel member of the ADAM 
family that is closely related to ADAM 19, a gene that was found to participate 
in the proteolytic processing of the membrane anchored protein neuregulin 1 
(NRG1) (Shirakabe et. al., 2001, J. Biol. Chem. 276(1 2):9352-8). The 

1 0 expression and activation of ADAM 1 9 protein has been localized to the trans- 
Golgi apparatus. This has been observed for other ADAM proteins (Lum et al., 
1998, J. Biol. Chem. 273:26236-26247; Roghani et. al., 1999, J. Biol. Chem. 
274:3531-3540; Shirakabe et. al., 2001, J. Biol. Chem. 276(1 2):9352-8). 
These data suggest that the ADAM genes, and Gene 216, encode proteins 

15 that function in the trans-Golgi apparatus as intracellular processing enzymes. 
The processed substrates of these enzymes may be released into the cytosol 
as part of a signal transduction cascade leading to the cell surface. 

The substrate of ADAM 19, NRG1 , belongs to a group of growth factors 
(neuregulins) that are members of the epidermal growth factor family. The 

20 neuregulins participate in an array of biological effects that are mediated by the 
epidermal growth factor family of tyrosine kinase receptors. Data suggest that 
the proteolytically cleaved isoform of NRG1 , NRG-01 , may induce the tyrosine 
phosphorylation of EGFR2 and EGFR3 in differentiated muscle cells 
(Shirakabe et. al., 2001, J. Biol. Chem. 276(1 2):9352-8). The sequence 

25 similarity of Gene 216 protein and ADAM 19 protein suggests that the 
neuregulins or their isoforms serve as substrates for Gene 216 protein. The 
Gene 216-processed neuregulins or isoforms may then serve as ligands for 
EGFR1. 

Epidermal growth factor receptor (EGFR1) plays a pivotal role in the 
30 maintenance and repair of epithelial tissue. Following injury in bronchial 
epithelium, EGFR1 is upregulated in response to ligands acting on it or through 
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transactivation of the EGFR1 receptor. This results in the increased 
proliferation of cells and airway remodeling at the point of insult, leading to the 
repair of the bronchial epithelium (Polosa et. a!., 1999, Am. J. Respir. Cell Moi. 
Biol. 20:914-923; Holgate et. al., 1999, Clin. Exp. Allergy Suppl 2:90-95). 
5 In asthma, the bronchial epithelium is highly abnormal, with structural 

changes involving separation of columnar cells from their basal attachments 
and functional changes that include increased expression and release of 
proinflammatory cytokines, growth factors, and mediator-generating enzymes. 
Beneath this damaged structure are the subepithelial myofibroblasts that have 

10 been activated to proliferate. This, in turn, causes excessive matrix deposition 
leading to abnormal thickening and increased density of the subepithelial 
basement membrane. 

Immunocytochemical studies have shown that both TGF- p and EGFR1 
are highly expressed at the area of injury and that parallel pathways could be 

15 operating in the repairing epithelial cells (Puddicombe et. al., 2000, FASEB J. 
14:1362-1374). EGFR1 stimulates epithelial repair and TGF- p regulates the 
production of profibrogenic growth factors and proinflammatory cytokines 
leading to extracellular matrix synthesis. As EGFR1 is involved in regulating 
a number of different stages of epithelial repair (survival, migration, proliferation 

20 and differentiation), any inhibitory effects that act on the receptor may cause 
the epithelium to be held in a "state of repair" (Holgate et. al., 1999, Clin. Exp. 
Allergy Suppl 2:90-95). 

Without wishing to be bound by theory, it is possible that a variant Gene 
216 protein induces the epithelium into a continuous "state of repair" by 

25 functioning improperly and failing to release its substrate (a member of the 
neuregulin family) that serves as the ligand for EGFR1. This, in turn, may 
cause the observed increase in EGFR1 expression. Under these 
circumstances, the TGF- p pathway remains active, producing a continuous 
source of proinflammatory products as well as growth factors that drive airway 

30 wall remodeling causing bronchial hyperresponsiveness, a phenotype of 
asthma. 
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It is also possible that the disintegrin-like domain of Gene 216 plays a 
role in respiratory diseases. Jntegrins are a family of heterodimeric 
transmembrane receptors that mediate cell-cell and cell-extracellular matrix 
interaction (Hynes, 1992, Ce//69:1 1). Integrins mediate angiogenesis (Brooks 
5 et al., 1994, Science 264:569), which plays a major role in various pathological 
mechanisms, such as tumor growth, metastasis, diabetic retinopathy, and 
certain inflammation diseases (Folkman, 1995, N. Engl. J. Med. 333:1757). 
Disintegrins act as integrin ligands that disrupt cell-matrix interactions (CP. 
Blobel and J.M. White, 1992, Curr. Opin. Cell Biol. 4:760-5) and inhibit 

10 angiogenesis (C.H. Yeh et al., 1998, Blood 92:3268-3276). Without wishing 
to be bound by theory, it is possible that the disintegrin-like domain of the Gene 
216 polypeptide inhibits angiogenesis in the respiratory system. Gene 216 
variants that have partly functional or non-functional disintegrin activity may 
lack anti-angiogenesis function. These Gene 216 variants may give rise to 

15 angiogenesis and inflammation in the respiratory system, a phenotype of 
asthma. 

EXAMPLE 9: Identification of the Mouse Homoloq for Gene 216 

The mouse ortholog of Gene 216 was identified by TBLASTN analysis 
of Gene 216 against mouse dbEST. BLAST analysis identified three mouse 

20 ESTs that were partially homologous to the human sequence but were not 
100% homologous to any known mouse ADAM genes. The three mouse ESTs 
were 100% identical to a partially sequenced mouse BAC (BAC389B9; 
Accession Number AF1 55960). This BAC maps to mouse chromosome 2 in 
a region that is syntenic to human chromosome 20p13. The 47 kb BAC 

25 sequence was analyzed for potential genes using the Genscan gene prediction 
program (Burge and Karlin, J. MoL Biol., 268:78-94). Additional putative exons 
were identified based on comparison of the human Gene 216 protein to the 
mouse BAC by TBLASTN. The results identified a mouse gene that contained 
an ORF of 2124 bp encoding a protein of 707 amino acids. The genomic 

30 nucleotide sequence of the mouse homolog is depicted in Figure 21 and the 
corresponding amino acid sequence is depicted in Figure 22. The mouse 
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amino acid sequence was analyzed by BLASTP analysis and found to have 
homology to mouse and human ADAM proteins. The mouse amino acid 
sequence was aligned against the amino acid sequence of human Gene 216 
(BestFit, http://www.gcg.com) (Figure 23). The results showed that the mouse 
5 and human proteins shared -70% identity at the amino acid level. This 
indicated that the mouse sequence was the murine orthblog of human Gene 
216. 

EXAMPLE 10: Polymorphism Identification 

Polymorphisms were identified in the chromosome 20 region and 

10 subsequently used in association studies. Most of the data focused on the 
region of Gene 216. 

1 . Single Nucleotide Polymorphism (SNP) Discovery : An efficient 
tiered approach was used for mutation analysis. First, PCR assays were 
developed across exons to include the consensus splice sites. Assays were 

15 designed for all exons that contribute to the open reading frame of the gene. 
This strategy ensured the detection of mutations that would result in the 
modification of the protein sequence as well as mutations that would be 
predicted to disrupt mRNA splicing. The identified promoter and putative 
regulatory element for Gene 216 and a large intronic region were assayed for 

20 polymorphisms as well. Second, a total of 77 individuals were tested for 
polymorphisms using fluorescent SSCP (single strand conformational 
polymorphism). This sample size provided a 99% power to detect a 
polymorphism with a frequency of 3% or greater. Briefly, PCR was used to 
generate templates from asthmatic individuals that showed increased sharing 

25 for the 20p13-p12 chromosomal region and contributed towards linkage. Non- 
asthmatic individuals were used as controls. Enzymatic amplification of Gene 
216 was accomplished using PCR with oligonucleotides flanking each exon as 
well as the putative 5* region. Primers were chosen to amplify each exon as 
well as 1 5 or more base pairs within each intron on either side of the splice 

30 site. The forward and the reverse primers were labeled with two different dye 
colors to allow analysis of each strand and confirm variants independently. 
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Standard PCR assays were utilized for each exon primer pair following 
optimization. Buffer and cycling conditions were specific to each primer set. 

The products were denatured using a formamide dye and electrophoresed on 
non-denaturing acrylamide gels with varying concentrations of glycerol (at least 
two different glycerol concentrations). 

Primers utilized in fluorescent SSCP experiments to screen coding and 
non-coding regions of Gene 216 for polymorphisms are provided in Table 8. 

Column 1 lists the genes targeted for mutation analysis. Column 2 lists the 
specific exons analyzed. Column 3 lists the primer names. Columns 4 and 5 
list the forward primer sequences and corresponding SEQ ID NOS, 
respectively. Columns 5 and 6 list the reverse primer sequences and 
corresponding SEQ ID NOS, respectively 

TABLE 8 



Gene 


Exon 


Assay Name 


Primer Sequence 


SEQ ID 

NO: 


Primer Sequence - 


SEQ ID 
NO: 


216 


216_A 


502_2 1 6_A_F_503_2 1 6_A_R 


Ctgcctagaggccgagga 


63 


agctctgagcagaacccatc 


106 


216 


216_A 


1 623J2 1 6_A_F_1 624_2 1 6_A_R 


Caggagaccacggaagatcg 


64 


ctcgagggggtggagctg 


107 


216 


216_A 


1 625_2 1 6_A_F_1 626_2 1 6_A_R 


Ttgcctgaaccttcctatcc 


65 


gagaggaggagagaaccgct 


108 


216 


216_B 


293_2 1 6_B_F_294_21 6_B_R 


Cccctgtgttcctcaggtc 


66 


agtgacttggtggttctggg 


109 


216 


216_C 


295_2 1 6_C_F_296_2 1 6_C_R 


Gctccacactctttcttgcc 


67 


tgtcatctgcaccctctctg 


110 


216 


216_D 


297J2 1 6„D_F_298_2 1 6 J)_R 


Aggcaggaggaagctgaat 


68 


aagagggagggtgtggtagg 


111 


216 


216_E 


1290_2 1 6_E_F_1 291_2 1 6_E_R 




69 


gtgatcaggccactagggtg 


112 


216 


216_F 


2992 1 6_F_F„300_2 1 6_F_R 


Cctacccctctgcacccta 


70 


atacagcattcccactccca 


113 


216 


216_G 


30 l_2 1 6_G_F_302_2 1 6_G_R 


aacttccttctgggagctgg 


71 


gaaggcagaaatcccggt 


114 


216 


216_H 


700 _2 1 6_H_F_70 1_2 1 6_H_R 


cacaccctggtgaggagaga 


72 


caccagcacctgcctgtc 


115 


216 


216_I 


305_2 1 6JLF_306_2 1 6J__R 


ccacgaaggaccaccg 


73 


gggtcagaggcacccac 


116 


216 


216J 


889_2 1 6_J_F_890_2 1 6_J_R 


ctcacgtgggtgcctctg 


74 


gccgtagagcctcctgtct 


117 


216 


216JC 


89 1_21 6_K_F_892_2 16_K_R 


ctctacggccgcagtgac 


75 




118 


216 


216_L 


3 1 1_216_L_F_312_216JL_R 




76 


tgagcggagagggcaagt 


119 


216 


216JL 


3 13_2 1 6_L_F_3 1 4_2 1 6 L_R 


caggttaagtcggctcgc 


77 


aaaccctcaccctgaacctt 


120 


216 


216„M 


3 1 5_2 1 6_M_F_3 1 6_2 1 6_M_R 


ctctctctgccttccccac 


78 


aagggtgctcgtgtcctct 


121 


216 


216_N 


3 1 7_2 1 6_N_F_3 1 8_216_N_R 


tctactgtggggaagatggg 


79 


ccactcagctccactcccta 


122 


216 


216_0 


319_21 6_O_F_320_21 6_0_R 


cccctctacttcctcccca 


80 


ggattcaaacggcaaggag 


123 


216 


216_P 


321_216_P_F_322_216_P_R 


gaccttggggttcctaatcc 


81 


gctgagtcctgagcaggtg 


124 


216 


2I6_Q 


323_2 1 6_Q_F_504_2 16_Q_R 


gtgcacctgctcaggactc 


82 


gaaccgcaggagtaggctc 


125 


216 


216_R 


325_2 16_R_F_326_216_R_R 


cctggactcttatcacgttgc 


83 


atatggtcagcaggagaccc 


126 


216 


216_S 


327_2 1 6_S_F_328_216_S_R 


ttaccctccaccatttctcc ' 


84 


gcatcctggtctccatgataa 


127 


216 


216_S 


1 308_2 1 6_S_F_1309_216_S_R 


gtggagagggaagggagaag 


85 


gaggctttgaatccaggtcc 


128 


216 


216_T 


1 294_2 1 6 JT_F_1295_2 1 6_T_R 


ccccatgggttgaatttaca 


86 


cagcaagacaccgcatctac 


129 


216 


216_T 


1 296_21 6JTJF_1297_2 1 6_T_R 


gcagctaggcctacaggtaca 


87 


gggacagagggaaccattta 


130 


216 


216_T 


1298_2 1 6JTJM299_2 16_T_R 


accacgcctatagccaacat 


88 


ttccttcctgtttcttccca 


131 
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216 


216_T 


1 300_2 1 6_T_F_1 30 1_2 1 6_T_R 


aggtgtagcactgggattgg 


89 


gtcctgggagtctggtgtgt 


132 


216 


216_T 


1 302_2 1 6_T_FJ 303„2 1 6_T_R 


ccccaggaccactagcttct 


90 


aggaacccagagccacacta 


133 


216 


216_T 


1 304_2 1 6_T_F_ 1 3 05_2 1 6JT.R 


attgagctggagagtgtgcc 


91 


tgcctctggtgagaggtagc 


134 


216 


216_T 


1 306_2 1 6„T_F_ 1 307_2 1 6_T_R 


ttcaagttcctggagtggct 


92 


ttcctggatcactggtcctc 


135 


216 


216_AA 


1 6 1 9_2 1 6_AA_F_ 1 620_2 1 6_AA_R 


acaaggaccctctaaacgca 


93 


ttcgagcagtgagagaaacct 


136 


216 


216_PQ 


1465J2 1 6_PQ_F_1466_2 1 6_PQ_R 


acccttctgtgacaagccag 


94 




137 


216 


216_QR 


1467_2 16_QR_F„1468_21 6_QR_R 


gtgttgctaccgactcccag 


95 


aggccactggaacctcct 


138 


216 


216_QR 


1469_2 1 6__QRJFJ470„2 1 6_QR_R 


cccaggtgcagagagcag 


96 


gcagcatggtacagggactg 


139 


216 


216_QR 


147 1_2 1 6_QR_F_1472_2 1 6_QR_R 


gctcctcttgtccactctcct 


97 


cagctgaccagtggtatgga 


140 


216 


216_QR 


1 473_2 1 6__QR_F_1 474_2 1 6_QR_R 


gccacttcctctgcacaaat 


98 


tgtcagacatggccac agag 


141 


216 


216_QR 


1 475„2 1 6_QR_F_ 1 476_2 1 6_QR_R 


ttctclgtgacctgggtggt 


99 


agggtcctcttagctgccac 


142 


216 


216_QR 


1 477_2 1 6_QR_F_1478_2 1 6_QR„R 


atttgggccagagatggg 


100 


aggccttgtcatttcctgtg 


143 


216 


216_QR 


1 479J2 1 6J}R_F_1480_2 1 6_QR_R 




101 


caaagaaccttggatgtccg 


144 


216 


2l6_QR 


1 48 1 _2 1 6_QR_F_ 1 482_2 1 6_QR_R 


atggcttggaatcatcaagg 


102 


ctcagctcccttcctgctc 


145 


216 


216„QR 


1483_2 16j3RJFJ484_2 1 6_QR_R 


tagagagaggaggtgccagc 


103 


ctgtgtgggcc atctttg 


146 


216 


216_RS 


1485_216„RSJM486_216JRS_R 


aaagatggcccacacagg 


104 


ggagaaatggtggagggtaa 


147 


216 


216_ST 


1487_2 1 6_STJFJ488_2 1 6_ST_R 


agaactctcatgagcccagc 


105 


aaagccacagcttctccct 


148 


216 


216_ST 


1 489_2 1 6_STJM 490_2 1 6_ST_R 


aggtttctgggctcaggtta 


149 


caggatcttggcatctggac 


153 


216 


216_UP 


1 463_2 1 6_UP_F_1464_2 1 6_UP_R 


gtaggtgtgccagagcagg 


150 


ctggcttgtcacagaagggt 


154 


216 


216_U 


1 292_2 1 6_U_F J 293_2 1 6_U_R 


tgtggacctagaatggtgagc 


151 


ctggagcacagtggcagtta 


155 


216 


216_V 


1 736_2 1 6_V_F_1 737_2 1 6_V_R 


caaagtcacacaacaagcgg 


152 


tttggtcgtccctcagtttc 


156 



Once polymorphisms were identified, multiple individuals representative 
of each SSCP pattern and two genomic controls were sequenced for 
polymorphism validation and to identify SNPs. The variants detected in the 
5 initial set of asthmatic and normal individuals were subject to fluorescent 
sequencing (ABI) using a standard protocol described by the manufacturer 
(Perkin Elmer). In cases where SSCP did not identify polymorphisms in Gene 
216, sequence information was obtained from 16 individuals that were identical 
by descent (I BD) in the region, and from 4 controls to ensure that potential 

10 polymorphisms were identified, 

Primers utilized in DNA sequencing for purposes of confirming 
polymorphisms detected using fluorescent SSCP are provided in Table 9. 
Column 1 lists the specific exons sequenced. Column 2 lists the forward 
primer names, column 3 lists the forward primer sequences, and column 4 lists 

15 the corresponding SEQ ID NOS. Column 5 lists the reverse primer names, 
column 6 lists the reverse primer sequences, and column 7 lists the 
corresponding SEQ ID NOS. 
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TABLE 9 



Exon 


Forward 


Forward Seq 


SEQ ID 

NO: 


Reverse Name 


Reverse Seq 


SEQ 
ID NO: 


216 A 


MDSeq 101 216 A„F 


cctctcaggaglagaggccc 


J57 


MDSeq 101 216 A R 


ccaagcacacttgagcgtc 


177 


216 A 


MDSeq 175 216 A F 


agcggttctctcctcctetc 


158 


MDSeq 175 216 A R 


agccatgccctctgcttt 


178 


216 A 

216 A 


MDSeq 213 216 A F 
MDSeq 334 216 A F 


cctctcaggagtagaggccc 
atgtiactgaggccgaaagg 


159 
160 


MDSeq 213 216 A R 
MDSeq 334 216 A R 


cagcccaagcacacttga 
cccatagctgtgagctcctc 


179 
180 


216 B 
216 C 


MDSeq 296 216 B F 
MDSeq 297 216 C F 


ccctttccagccttctcttt 
caggactgcaaacatcctga 


161 
162 


MDSeq 296 216 B R 
MDSeq 297 216 C R 


a tc ttggtccctgccattc 


1S1 
182 


216 D 


MDSeq_61 216 D F 




163 


MDSeq 61 216 D R 




183 


216 E 


MDSeq 245 216 E F 


aggcaggaggaagctgaat 


164 


MDSeq 245 216 B R 


ggaccaccaggaaggctg 


184 


216 F 


MDSeq 57 216 F F 


cctcttgcccctcttgct 


165 


MDSeq_57 216 F R 


aaccccagctcccagaag 


185 


216 G 


MDSeq 336 216 G F 


cctgaatgtecagagtcctga 


166 


MDSeq 336 216 G R 


ctgolcacctggaaaggaae 


186 


216 H 


MDSeq 155 216 H F 


ggcctc gagtcccagtattt 


167 


MDSeq 155 216 H R 


actgcaggaaggcccagag 


187 


216 I 


MDSeq 363 216 1 F 




168 


MDSeq 363 216 I R 


accgaaacttgaaccacacc 


188 


216 J 


MDSeq 181 216 J F 


tcgccctcagcttctcag 


169 


MDSeq 181 216 J R 


tgag ggacg accaaagaaac 


189 


216 K 


MDSeq 182 216 K F 


tcacgtgggtgcctctga 


170 


MDSeq 182 216 K R 


caaagtcacacaacaagcgg 


190 


216 L 


MDSeq 106 216 L F 




171 


MDSeq 106 216 L R 


gaacc tgagg gcaccaa tta 


191 


216 M 


MDSeq 337 216 M F 


ctgggctttccaccctgg 


172 


MDSeq 337 216 M R 


ttg gccttagttaattggtgc 


192 


216 N 


MDSeq 338 216 N F 


ctgggctttccaccctgg 


173 


MDSeq 338 216 N R 


ttggccttagttaattggtgc 


193 


216 O 


MDSeq 49 216 O F 


tccaggtggtgaactctgc 


174 


MDSeo_49 216 O R 


ctggagcacagtggcagtta 


194 


216 P 


MDSeq 248 216 P F 


tagaatggtgagctctgccc 


175 


MDSeq 248 216 P R 


aggagtaggctcaggaagca 


195 


216 Q 


MDSeq 96 216 Q F 


gaccttggggttcctaatcc 


176 


MDSeq 96 216 Q R 


tgtactgggaggtagagggc 


196 


216 R 


MDSeq 50 216 R F 


agagggtgactlggagcaga 


197 


MDSeq 50 216 R R 


ccagaaacctgattaggggg 


219 


216 S 


MDSeq 262 216 S F 


aggcaataacccactcagga 


198 


MDSeq 262 216 S R 


tacctctcaccagaggcagg 


220 


216 T 


MDSeq 255 216 T F 


cccatgggttgaatttacata 


199 


MDSeq 255 216 T R 


gccagaagctagtggtcctg 


221 


216 T 


MDSeq 256.216 T F 


gcctctggtgatcctcctac 


200 


MDSeq 256 216 T R 


gcaggcagcttggaagttt 


222 


216 T 
216 T 


MDSeq 257 216 T F 
MDSeq 258 216 T F 


actcagtcgaaccatagggc 


201 
202 


MDSeq 257 216 T R 
MDSeq 258 216 T R 


ttatcatggagaccaggatgc 
gacctggattcaaagcctcc 


223 
224 


216 T 
216 T 


MDSeq 358 216 T F 
MDSeq 365 216 T F 


actcagtcgaaccatagggc 


203 
204 


MDSeq 358 216 T R 
MDSeq 365 216 T R 


atgttggctataggcgtggt 
ttatcatggagaccaggatgc 


225 
226 


216 U 


MDSeq 244 216 U F 


gc aggaa ggtgtcatg gtct 


205 


MDSeq 244 216 U R 


ctgagtggagggagcagaag 


227 


216 U 


MDSeq 292 216 U F 


gcaggaaggtgtcatggtct 


206 


MDSeq 292 216 U R 


ctgagtggagggagcagaag 


228 


216 V 


MDSeq 389 216 V F 


gggcattggagaggcaag 


207 


MDSeq 389 216 V R 


ccatgagatcggccacag 


229 


216 AA 


MDSeq 360 216 AA F 




208 


MDSeq 360 216 AA R 


atttcaaggctgcaatgagg 


230 


216 PQ 


MDSeq 300 216 PQ F 


agaatgccttccaggagctt 


209 


MDSeq 300 216 PQ R 


acttctttccatR gcctc tg 


231 


216 QR 


MDSeq 301 216 QR F 


gtgttgctaccgactcccag 


210 


MDSeq_301 216 QR R 


accacccaggtcacagagaa 


232 


216 QR 


MDSeq 303 216 QR F 




211 


MDSeq 303 216 QR R 


tcccaagaccaggctatgtc 


233 


216 QR 


MDSeq 32t 216 QR F 


aacaggaggttccagtggc 


212 


MDSeq 321 216 QR R 


ctggggatgagaagcagc 


234 


216 QR 


MDSeq 322 2 16 QR F 


agcgagftgtgattgagggt 


213 


MDSeq_322 216 QR R 


cttctcccitccctctccac 


235 


216 QR 


MDSeq 361 216 QR F 


tgtgcaggctgaaagtatgc 


214 


MDSeq 361 216 QR R 


atttgtgcagaggaagtggc 


236 


216 QR 


MDSeq 362 216 QR F 


gccacttcctclgcacaaat 


215 


MDSeq 362 216 QR R 


catttcctccaggctctgac 


237 


216 RS 


MDSeq 339 216 RS F 


ctga gcccagaaacctg att 


216 


MDSeq 339 216 RS R 


tcagagcctggaggaaatgt 


238 


216 ST 


MDSeq 302 216 ST F 


gtgagtgaggcaccaggg 


217 


MDSeq 302 216 ST R 


gttcctggagtgggtgggt 


239 


216 UP 


MDSeq 359 216 UP F 


cctagatggccaggaagtga 


218 


MDSeq 359 216 UP R 


ctgggagtcggtagcaaca 


240 



Single nucleotide polymorphisms (SNPs) that were identified in Gene 
216 are provided in Table 10. Column 1 lists the SNP numbers (1-48). 
Column 2 lists the exons that either contain the SNPs or are flanked by intronic 
sequences that contain the SNPs. Column 3 lists the PMP sites for the SNPs. 

A "-" denotes polymorphisms which are 5' of the exon that are within the 
intronic region. The corresponding number is given from the 3 f to 5' direction. 

A "+" denotes polymorphisms which are 3' of the exon that are within the 
intronic region. The number corresponding to the "+" is given from the 5' to 3' 
direction. Columns 2 and 3, combined, show the SNP names as described 
herein, e.g., T+1 , T+2, etc. Column 4 indicates whether the SNP was detected 
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in an exon or intrpn sequence. Column 5 lists the SNP locations in the Gene 
216 genomic sequence of SEQ ID NO:6 (Figure 7). Column 6 lists the SNP 
reference sequences which illustrate the SNP nucleotide changes with 
underlining. Column 7 lists the SEQ ID NOs of the SNP reference sequences. 
5 Column 8 lists the base changes of the SNP sequences. Column 9 lists the 
amino acid changes resulting from the SNP sequences. 



TABLE 10 



SNP 


Exon 


PMP 
site 


Locatio 


Locatio 
n 


Sequence (20nt+SNP+20nt) 


SEQ ID 
NO: 


PMP 


AA 

Change 


1 


A 


-1 


ntron 


4653 


GCCCTCTGAGACCGACGGGGAGGGACGGCTCGGGCCGGTC 
A 


241 


A>T 




2 


A 


-2 


ntron 


4610 


CAAG A ACCTTCCC AG CG GTT£TCTCCTCCTCTCAGG AGTAG 


242 


OA 




3 


c 


-1 


ntron 


9827 


CACCATCTCAGCTCCACACTCTTTCTTGCCCAGGTCTCGAA 


243 


OT 




4 


c 


-2 


ntron 


9826 


CCACC ATCTCAGCTCCACACTCTTTCTTG CCC AG GTCTCG A 


244 


T>A 




5 


D 


-1 


intron 


11687 




245 


G>C 














6 


D 


-2 


intron 


11661 


TGGTGCTTCCCATATTCACATCTCCCACAACTAAGCCATCA 


246 


T>C 




7 


D 


1 




11912 


CAGGATACATAGAAACCCACJACGGCCCAGATGGGCAGCC 
A 


247 


T>C 


TyioHis 


3 


F 


4-1 




12545 


CCCTCCAAATCAGAAGAGACAGGAATTCACAGGCCTCGAG 
T 


248 


A>G 




9 


F 


1 




12411 


AGCTGCTCACCTGGAAAGGAACCTGTGGCCACAGGGATCC 
T 


249 


A>G 


Thr>AIa 


10 










ACTTCCTTCTG G G AG CTGG G G.TTG q qq q TC AGG G CTC AAG C 


250 


G>A 




— 











TTCCTGCAGTGGCGCCGGGGG.CTGTGGGCGCAGCGGCCCC 
A 


251 


G>A 


none 












G 




G>T 














CAGAGAAGCGCGGGGGTTGGGGGACTGTCCCTCCATGCCC 

A 


253 


G>A 




14 


L 


-2 


intron 


13988 


CCCCTCTCTGGGCTCTGCGCGTCTGGCGGCTGTAGCCAAGC 


254 


G>A 




15 


L 






14135 


CAGCCGCCGCCAGCTGCGCGCCTTCTTCCGCAAGGGGGGC 
G 


255 


OT 


Ala>VaI 


16 


Q 


+1 


intron 


I6I58 


AGTGGCCTCCCAGTCAAGCGAGGGGGTGGATCCCTGCCCC 
A 


256 


A>T 




17 


Q 


1 




15865 


TGCTGGCCATGCTCCTCAGCQTCCTGCTGCCTCTGCTCCCA 


257 


G>A 


Val>Ile 


18 


Q 


2 




15S8S 


CTGCTGCCTCTGCTCCCAGG2GCCGGCCTGGCCTGGTGTTG 


258 


G>C 




19 


QR 


+1 


intron 


16133 


GAAGTAGCTTTGAACAGGAG2TTCCAGTGGCCTCCCAGTCA 


259 


G>T 




20 


QR 


+3 


intron 


16361 


GCCTCTGTCTCACCAGTTTT£GGCCCTTTGCCACTTCCTCT 


260 


OT 




21 


QR 


+4 


intron 


16404 


ACAAATCACCTCTGTCACCCCCTTGAAGTTCCCAAATGCTG 


261 


OA 




22 


QR 


+5 


intron 


16465 


TCCATACCACTGGTCAGCTG£GGTGCTGGCTGCCCCTGTGC 


262 


OT 




23 


QR 


+6 


intron 


16486 


GGTGCTGGCTGCCCCTGTGCCAGGGCCCTGCCTTAACCCAG 


263 


OT 




24 


QR 


+7 


intron 


16936 


GGAAATGACAAGGCCTTGGGaGATGGGATGGGGACAGTCA 
A 


264 


G>A 




25 


R 


+1 


intron 


17510 


AGGGCTCATGCCTCCTGCCTCCTTCCAGATGGGCAGCACCC 


265 


OT 




26 


R 


+2 


intron 


17571 




266 


T>G 
















27 


R 


1 




17403 


CCTGGGCGGCGTTCACCCCAIGGAGTTGGGCCCCACAGCC 
A 


267 


T>C 


Met>Thr 


28 


R 


2 




17432 


GCCCCACAGCCACTGGACAGCCCTGGCCCCTGGGTGAGTG 
A 


268 


OT 


Pro>Ser 


29 


RS 


-1 


intron 


17451 


GCCCTGGCCCCTGGGTGAGTGAGGCACCAGGGGGAGGTGG 
A 


269 


G>T 
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30 


T 


+1 


intron 


17958 


TGCAGCCTGGGGCCCCAGTCCTTAGGGGACAACATATCCTC 


270 


OA 




31 


T 


+2 


intran 


17924 


CACTGAGTGAGGATGGGCTCJCTGCCACACAGCTTGCAGCC 


271 


T>C 




32 


T 


+3 


intron 


17916 


CTGGTCCTCACTGAGTGAGGATGGGCTCTCTGCCACACAGC 


272 


A>G 




33 


T 


+4 


intran 


17834 


ATGACCTCTTGGTTATCATGQAGACCAGGATGCTGGAAGCC 


273 


G>C 




34 


T 


1 


3' UTR 


18833 


AGCAAGACACCGCATCTACAGAAAAATTTTAAAATTAGCTG 


274 


G>A 




35 


T 


2 




18787 


GGAGGATCACCAGAGGCCAGCAGGTCCACACCAGCCTGGG 
C 


275 


OG 




36 


T 


3 


3 1 UTR 


18760 


ATCCCAGCACTTTGGGAAGCQGGGGTAGGAGGATCACCAG 
A 


276 


OT 




37 


T 


4 


3' UTR 


18497 


AGCCTGGCTGGCCTCTGCAAACAAACATAATTTTGGGGACC 


277 


<\>G 




38 


T 


5 




18476 


ACTGAGTCCACACTCCCCTGCAGCCTGGCTGGCCTCTGCAA 


278 


OG 




39 


T 


6 




18206 


TCCAGGAACCCAGAGCCACAJTAGAAGTTCCTGAGGGCTG 
G 


279 


r>c 




40 


T 


7 




18174 


TTCTTCCCCGAGTGGAQCTTQGACCCACCCACTCCAGGAAC 


280 


OT 




41 


T 


8 




17997 


TCCTCATTCTCAGCAGATCAAGTCCAGATGCCAAGATCCTG 


281 


A>T 


Glfl>His 


42 


T 


-2 


intron 


19094 


C TG AG G ACC ACACG GG G TG GTG GTTG G CG G G GTG G TG GTT 
G 


282 






43 


T 


-4 


intron 


19160 


GGCTGGCAGGCCGAGCCTAGATGGCAGCCAGAGCCCCAGG 
C 


283 


\>G 




44 


T 


-5 


intron 


19244 


CTTTGCTCTGTCACTCCTGCC.TCCCTTGGGCGTTCACATTC 


284 


Z>T 




45 


U 


-1 


intron 


15423 


GTGAGCTCTGCCCACCCGACC.CCTCCTTGCCGTTTGAATCC 


285 


OT 




46 


V 


+i 


intron 


13859 


TGGCGAGGTTACTCCTACACCGGGAGGAGCACCGTCGGGT 
C 


286 


OT 




47 


V 


+2 


intron 


13921 


GGCTGCTCACTATTGGGGCCGCATCGTCCCCTGTCCCGCTT 


287 


3>T 




48 


V 


+3 


Intron 


13938 




288 


3>A 





Using an in-house program called snp_yiew; the genomic structure of 
the gene is diagrammatically shown in Figure 11. The exons are shown to 
scale and the SNPs are identified by their location along the genomic BAC 
5 DNA. The polymorphic sites identified in the Gene 21 6 genomic sequence are 
also shown by the underlined nucleotides in Figure 29. The polymorphic sites 
discovered within the cDNA and the corresponding amino acid position in Gene 
216 are underlined in Figure 24. It will be understood by those of skill in the art 
that the SNPs identified in the Gene 21 6 genomic sequence can be correlated 
10 to the SNP positions identified in the Gene 216 cDNA sequence by aligning the 
genomic and cDNA sequences. 
EXAMPLE 11: Polymorphism Genotyping 

Once putative variants were confirmed by sequencing, rapid allele 
specific assays were designed to type more than 400 individuals (> 200 cases 
15 and > 200 controls) for use in the association studies. All coding SNPs 
(cSNPs) that resulted in an amino acid change were typed. Neutral 
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polymorphisms were typed if: 1) the polymorphism was present in an exon 
lacking a cSNP that resulted in an amino acid change; 2) the polymorphism 
was present in an exon containing a cSNP resulting in an amino acid change 
but the two polymorphisms were observed to have different frequencies; and 
5 3) the polymorphism was in an intronic region adjacent to an exon without a 
cSNP. If results from the association studies appeared positive, additional 
neutral polymorphisms were typed. More than 30 allele specific assays from 
Gene 216 were typed for the case control population (Table 11). 

Two types of allele specific assays (ASAs) were used. If the SNP 
10 resulted in a mutation that created or abolished a restriction site, restriction 
fragment length polymorphisms (RFLPs) were obtained from PCR products 
that spanned the variants, and the RFLPs were analyzed. If the 
polymorphisms did not result in RFLPs, allele specific oligonucleotide assays 
were used. For these assays, PCR products that spanned the polymorphism 
1 5 were electrophoresed on agarose gels and transferred to nylon membranes by 
Southern blotting. Oligomers 16-20 bp in length were designed such that the 
middle base was specific for each variant. The oligomers were labeled and 
successively hybridized to the membrane in order to determine genotypes. 
The specific method used to type each SNP is indicated in Table 11. 
20 Table 1 1 below contains the information relating to the specific assay 

used. Column 1 lists the SNP designation number. Column 2 lists the specific 
assay used, either RFLP or ASO. Column 3 lists the enzyme used in the RFLP 
assay (described below). Columns 4 and 6 list the sequence of the primers 
used in the ASO assay (described below). Columns 5 and 7 list the 
25 corresponding SEQ ID NOS for the primers. 

1 . RFLP Assay : The amplicon contain! ricfthe polymorphism was 
PCR amplified using primers that were used to generate a fragment for 
sequencing (sequencing primers) or SSCP (SSCP primers). The appropriate 
population of individuals was PCR amplified in 96 well microtitre plates. 
30 Enzymes were purchased from NEB. The restriction cocktail containing 

the appropriate enzyme for the particular polymorphism is added to the PCR 
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product. The reaction was incubated at the appropriate temperature according 
to the manufacturer's recommendations (NEB) for 2-3 hr, followed by a 4°C 
incubation. After digestion, the reactions were size fractionated using the 
appropriate agarose gel depending on the assay specifications (2.5%, 3%, or 
5 Metaphor, FMC Byproducts). Gels were electrophoresed in 1 X TBE Buffer 
at 170 Volts for approximately 2 hr. The gel was illuminated using ultraviolet 
light and the image was saved as a Kodak 1D file. Using the Kodak 1D image 
analysis software, the images were scored and the data was exported to 
Microsoft EXCEL (http://www.microsoft.com). 

10 2. ASO assay : The amplicon containing the polymorphism was 

PCR amplified using primers that were used to generate a fragment for 
sequencing (sequencing primers) or SSCP (SSCP primers). The appropriate 
population of individuals was PCR amplified in 96 well microtitre plates and re- 
arrayed into 384 well microtitre plates using a Tecan Genesis RSP200. The 

15 amplified products were loaded onto 2% agarose gels and size fractionated at 
150V for 5 min. The DNA was transferred from the gel to Hybond N+ nylon 
membrane (Amersham-Pharmacia) using a Vacuum blotter (Bio-Rad). The 
filter containing the blotted PCR products was transferred to a dish containing 
300 ml pre-hybridization solution (5 X SSPE (pH 7.4), 2% SDS, 5 X 

20 Denhardt's). The filter was incubated in pre-hybridization solution at 40°C for 
over 1 hr. After pre-hybridization, 10 ml of the pre-hybridization solution and 
the filter were transferred to a washed glass bottle. The allele specific 
oligonucleotides (ASO) were designed with the polymorphism in the middle. 
The size of the oligonucleotide was dependent upon the GC content of the 

25 sequence around the polymorphism. Those ASOs that had a G or C 
polymorphism were designed so that the T m was between 54-56°C and those 
that had an A or T variance were designed so that the T m was between 60- 
64°C. All oligonucleotides were phosphate free at the 5' end and purchased 
from GibcoBRL. For each polymorphism, 2 ASOs were designed: one for 

30 each variant. 

The two ASOs that represented the polymorphism were resuspended 

-154- 



WO 01/78894 



PCT/US01/12245 



at a concentration of 1 ug/ul and separately end-labeled with y-ATP (6000 
Ci/mmol) (NEN) using T4 polynucleotide kinase according to manufacturer 
recommendations (NEB). The end-labeled products were removed from the 
unincorporated y-ATP 32 by passing the reactions through Sephadex G-25 
5 columns according to manufacturers recommendation (Amersham-Pharmacia). 
The entire end-labeled product of one ASO was added to the bottle containing 
the appropriate filter and 10 ml hybridization solution. The hybridization 
reaction was placed in a rotisserie oven (Hybaid) and left at 40°C for a 
minimum of 4 hr. The other ASO was stored at -20° C. 
10 After the prerequisite hybridization time had elapsed, the filter was 

removed from the bottle and transferred to 1 L of wash solution (0.1 X SSPE 
(pH 7.4), 0.1% SDS) pre-warmed to 45°C. After 15 min, the filter was 
transferred to another L of wash solution (0.1 X SSPE (pH 7.4), 0.1% SDS) 
pre-warmed to 50°C. After 15 min, the filter was wrapped in Saran, placed in 
15 an autoradiograph cassette and an X-ray film (Kodak) placed on top of the 
filter. Typically, an image would be observed on the film within 1 hr. After an 
image had been captured on film for the 50°C wash, the process was repeated 
for wash steps at 55°C, 60°C and 65°C. The image that captured the best 
result was used. 

20 The ASO was removed from the filter by adding 1 L of boiling strip 

solution (0.1 x SSPE (pH 7.4), 0.1% SDS). This was repeated two more 
times. After removing the ASO the filter was p re-hybridized in 300 ml pre- 
hybridization solution (5 X SSPE (pH 7.4), 2% SDS, 5 X Denhardt's) at 40°C 
for over 1 hr. The second end-labeled ASO corresponding to the other variant 

25 was removed from storage at -20°C and thawed at room temperature. The 
filter was placed into a glass bottle along with 10 ml hybridization solution and 
the entire end-labeled product of the second ASO. The hybridization reaction 
was placed in a rotisserie oven (Hybaid, http://www.hybaid.co.uk) and left at 
40°C for a minimum of 4 hr. After the hybridization, the filter was washed at 

30 various temperatures and images captured on film as described above. 

The two films that best captured the allele-specific assay with the two 
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ASOs were converted into digital images by scanning them into Adobe 
PhotoShop. These images were overlaid against each other in Graphic 



Converter and then scored. 

TABLE 11 



SNP 


ASA 
Type 


RFLP 
Enzyme 


ASO Primerl 


SEQID 
NO: 


ASO Primer2 


SEQID NO: 


1 


ASO 




jc c gtc c c ac c c cgtcg 


289 


gccgtccctccccgtcg 


299 


2 


ASO 




cctcctctcttggcgac 


290 


tcctcctctattggcgaccc 


300 


3 


ASO 




tccacactctttcttgcc 


291 


ctccacactttttcttgccca 


301 


4 


ASO 




gctccacactctttcttgcc 


292 


^ctccacactctttcttgc 


302 


5 


ASO 




tcaccaaggctccttcct 


293 


;caccaagcctccttcct 


303 


6 


Alt. Meth 










7 


RFLP 


Xcml 










8 


ASO 




cagaagagacaggaattcaca 


294 


agaagagacgggaattcac 


304 ! 


9 


ASO 




tggaaaggaacc tgtggcc 


295 


tggaaaggagcctgtgg 


305 


10 


ASO 












11 


ASO 




•>',;■; 








12 


ASO 




gggtttcggggagcttg 


296 


agggtttcgtggagcttgg 


306 


13 


ASO 




gggttgggggactgtc 


297 


ggggttggaggactgtcc 


307 


14 


ASO 




ctctgcgcgtctggcg 


298 


gctctgcgcatctggcgg 


308 


15 


RFLP 


BssHII 










16 


ASO 




agtcaagcgagggggtgg 


309 


agtcaagcgtgggggtgg 


322 


17 


ASO 




cctcagcgtcctgctg 


310 


ctcctcagcatcctgctgc 


323 


18 


RFLP 


KasI 










19 


ASO 




aacaggaggttccagtgg 


311 


gaacaggagtttccagtggc 


324 


20 


ASO 




accagttttcggcccttt 


312 


cacc agtttttggc cctttg 


325 


21 


ASO 




ctgtcacccccttgaagt 


313 


ctgtcacccacttgaagttc 


326 


22 


ASO 




tcagctgcggtgctgg 


314 


ggtcagctgtggtgctgg 


327 


11 

ZJ 


RFLP 


BstNI 










24 


ASO 




gccttgggggatgga 


315 


aggccttgggagatgggat 


328 


25 


ASO 




tcctgcctccttccag 


316 


tcctgccttcttccag 


329 


26 


RFLP 


Bgll 










27 


RFLP 


Ncol 










28 


ASO 




actggacagccctggc 


317 


actggacagtcctggc 


330 


29 


ASO 












30 


RFLP 


Bsu36I 










31 


ASO 




ctgtgtggcagagagccca 


318 


tgtggcagggagccca 


331 


32 


ASO 












33 


RFLP 


Bsal 










34 


Alt. Meth 










35 


RFLP 


Cac8I 










36 


RFLP 


Mspl 










37 


ASO 






319 


attatgtttgcttgcagagg 


332 


38 


RFLP 


Fnu4HI 




320 




333 


39 
40 


ASO 
RFLP 


TaqI 


gaacttctagtgtggctct 








41 


RFLP 


Main 










42 


ASO 












43 


RFLP 


Styl 
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44 


ASO 




ccaagggaggcaggagt 


321 




334 


45 


RFLP 


Hinfl 










46 


RPLP 


BsrI 










47 


RFLP 


Ecol09 I 








48 


ASO 


1 









EXAMPLE 12: Association Study Analysis 

1. Case-Control Study : In order to determine whether 
polymorphisms in candidate genes were associated with the asthma 
5 phenotype, association studies were performed using a case-control study 
design. In a well- matched design, the case-control approach is more powerful 
than the family based transmission disequilibrium test (TDT) (N.E. Morton and 
A. Collins, 1998, Proc. Natl. Acad. ScL USA 95:11389-93). Case-control 
studies are, however, sensitive to population heterogeneity. 

10 To avoid issues of population admixture, which can bias case-control 

studies, the unaffected controls were collected in both the US and the UK. A 
total of three hundred controls were collected, 200 in the UK and 100 in the 
US. Inclusion into the study required that the control individual was negative 
for asthma, as determined by self-report of never having asthma, had no first 

15 degree relatives with asthma, and was negative for eczema and symptoms 
indicative of atopy within the past 12 months. Data from an abbreviated 
questionnaire similar to that administered to the affected sib pair families were 
collected. Results from skin prick tests to 4 common allergens were also 
collected. The results of the skin prick test were used to select a subset of 

20 controls that were most likely to be asthma and atopy negative. 

A subset of unrelated cases was selected from the affected sib pair 
families based on the evidence for linkage at the chromosomal location near 
a given gene. One affected sib demonstrating identity-by-descent (IBD) at the 
appropriate marker loci was selected from each family. Since the appropriate 

25 cases may vary for each gene in the chromosome 20 region, a larger collection 
of individuals who were IBD across a larger interval were genotyped, and a 
subset was used in the analyses. On average, 130 IBD affected individuals 
and 200 controls were compared for allele and genotype frequencies. This 
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number provided an 80% power to detect a difference of 5% or greater 
between the two groups for a rare allele (< 5%) at a 0.05 level of significance. 
For a common allele (50%), the number provided an 80% power to detect a 
difference of 10% or more between the two groups. 
5 For each polymorphism, the frequency of the alleles in the control and 

case populations was compared using a Fisher exact test. A mutation that 
increased susceptibility to the disease would be more prevalent in the cases 
than in the controls, while a protective mutation would be more prevalent in the 
control group. Similarly, the genotype frequencies of the SNPs were compared 
10 between cases and controls. P-values for both the allele and genotype were 
plotted against a coordinate system based on genomic sequence to visualize 
regions where allelic association was present. A small p-value (or a large 
value of -log (p) as plotted in the figures described below) was indicative of an 
association between the SNPs and the disease phenotype. The analysis was 
1 5 repeated for the US and UK population separately to adjust for the possibility 
of genetic heterogeneity. 

2. Association test with individual SNPs : Chromosomal regions 
harboring asthma susceptibility genes were identified by association studies 
using the SNP typing data. Two separate phenotypes were used in these 
20 analyses: asthma and bronchial hyper-responsiveness. 

a. Asthma Phenotype : The significance levels (p-values) for 
allelic association of all typed SNPs in Gene 21 6 to the asthma phenotype are 
plotted in Figure 25 (combined population) and Figure 26 (US and UK 
populations separately). The most significant result in the combined population 
25 was observed for Gene 21 6 exon T+1 , where 92.4% of the cases harbored the 
intronic mutation, while the SNP was present in only 85.2% of the controls (p 
= 0.0055). Six additional SNPs in Gene 216 (T5, QR+7, QR+4, Q2, Q1, and 
U-1) were significant at the 0.05 level. Frequencies and p-values for SNPs 
associated with the asthma phenotype in Gene 216 are presented in Tables 
30 12, 13, and 14 for the combined population and for the UK and US 
populations, separately. 
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TABLE 12 



Asthma Yes/NO 


Combined 
US and UK 


GENE_EXON 


Frequencies 
CNTL 


N 


CASE 


N 


ALLELE 
P-VALUE 


GENOTYPE 
P-VALUE 


gene216_T_2 


66.5% 


215 


71.5% 


128 


0.2029 


0.1482 


gene216JT_3 


8.7% 


213 


9.5% 


131 


0.7841 


0.6895 


gene216_T_4 


96.3% 


215 


98.5% 


129 


0.1576 


0.1513 


gene216_T_5 


76.7% 


217 


83.3% 


129 


0.0420 


0.0468 


gene216_T_6 


77.8% 


214 


78.4% 


125 


0.9235 


0.9791 ' 


gene216_T_7 


96.3% 


215 


98.5% 


129 


0.1576 


0.1513 


gene216_T_8 


96.5% 


211 


98.1% 


129 


0.2528 


0.2456 


gene216JL_+1 


85.2% 


216 


92.4% 


131 


0.0055 


0.0178 


gene2l6_T_+2 


37.3% 


209 


39.0% 


127 


0.6825 


0.7722 


gene21 6_T_+4 


24.4% 


215 


26.3% 


131 


0.5886 


0.7410 


gene216_R_+2 


88.3% 


217 


88.9% 


131 


0.8076 


0.9005 


gene216_R_+1 


88.7% 


191 


88.8% 


120 


1.0000 


0.8394 


gene216_R_2 


9.4% 


208 


10.8% 


125 


0.5928 


0.7656 


gene216_R_1 


11.3% 


217 


11.8% 


131 


0.9025 


0.7483 


gene216_QR_+7 


78.1% 


215 


85.7% 


129 


0.0160 


0.0265 


gene216_QR_+6 


0.5% 


216 


0.8% 


129 


0.6323 


0.6317 


gene216_QR_+5 


46.4% 


210 


48.8% 


129 


0.5794 


0.4165 


gene216_QR_+4 


51.5% 


205 


59.9% 


126 


0.0367 


0.1272 


gene216_Q_+1 


51.2% 


206 


52.5% 


120 


0.8075 


0.6608 


gene2l6_Q_2 


73.7% 


217 


80.5% 


131 


0.0432 


0.0831 


gene216_Q_1 


89.5% 


209 


94.8% 


125 


0.0213 


0.0584 


gene216_U_-1 


85.0% 


217 


91 .2% 


131 


0.0184 


0.0659 


gene216_L_+1 


88.7% 


213 


88.9% 


131 


1.0000 


0.9672 


gene216_L_1 


99.3% 


217 


99.6% 


131 


1.0000 


1.0000 


gone21 6_L_-1 


88.9% 


212 


89.2% 


130 


1 .0000 


1 .0000 


gene216_L_-2 


92.9% 


212 


93.1% 


131 


1.0000 


0.9379 


gene2l 6_V__+2 


71.3% 


216 


77.1% 


129 


0.1085 


0.2262 


gene21 6_V_+1 


96.1% 


217 


97.2% 


125 


0.5223 


0.5145 


gene216_l_1 


84.9% 


212 


85.3% 


129 


0.9124 


1.0000 


gene216_G_-1 


90.7% 


210 


91.3% 


127 


0.8900 


0.7683 


gene216_F_+1 


65.2% 


197 


70.4% 


120 


0.1913 


0.4109 


gene216_F_1 


96.8% 


217 


96.9% 


129 


1 .0000 


1.0000 


gene216_D_1 


0.0% 


215 


0.4% 


131 


0.3786 


0.3786 


gene216_D_-2 


0.7% 


214 


0.8% 


127 


1.0000 


1.0000 
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TABLE 13 



Asthma Yes/No 


UK population 




Frequencie 
s 

CNTL 








ALLELE 


GENOTYPE 




N 


CASE 


N 


P-VALUE 


P-VALUE 


npnp91R T 9 


65.8% 


1 39 


74.3% 


101 


0.0566 


0.1266 


ytsi itJt i v? i __o 


8 3% 


139 


9 6% 


104 


n R*i08 

U.UOvU 


0.7329 


ytsi let. i vj i *r 


97.1% 


140 


98.5% 


103 


n SRRQ 


0 3633 


nf»n»9"1R T R 
y tsi \\s£. i u i >j 


75.4% 


140 


83.3% 


102 


0.0426 


U.UJDJ 


npnp9"lR T R 


78.5% 


137 


80.1% 


98 


0.7301 


0.8875 


npnp91R T 7 


97.5% 


138 


99.0% 


102 


0.3129 


0.3082 


gene216 T 8 


97.8% 


137 


98.5% 


102 


0.7388 


0.7363 


gene216 T +1 


86.4% 


140 


93.8% 


104 


0.0105 


0.0243 


gene216 T +2 


37.9% 


136 


40.5% 


100 


0.5682 


0.8375 


npnp91R T 
ycijc^ i L/ f ■ *r 


25.2% 


139 


26.0% 


104 


0.91 63 


0.6037 


gene216 R +2 


87.5% 


140 


87.5% 


104 


1 .0000 


1 .0000 


npnp91R R +*1 
yen kzi-. i r\ ~ i 


86.9% 


1 22 


91.1% 


95 


0.2211 


0.4281 


npnp91R R 9 


10.5% 


134 


8.2% 


98 


0.4279 


0.7007 


npnp91fi R 1 


13.2% 


140 


8.7% 


104 


0.1473 


0.3472 


yci ic^. i vj VjtPt ~ / 


79.5% 


139 


86.4% 


103 




0.1362 


npnp?1R OR +R 


0.0% 


139 


1 .0% 


103 


0.1806 


0.1801 


npnp9*lR OR +R 


44.4% 


133 


50.0% 


102 


0.2273 


0.2470 


npnp91R OR -t-d 


48.1% 


128 


59.1% 


99 


0.0229 


0.0730 


npnp?1R O +1 


53.1% 


129 


50.5% 


97 


0.6346 


0.5458 


gene216 Q 2 


72.9% 


140 


84.6% 


104 


0.0020 


0.0050 


gene216_Q_1 


89.4% 


132 


95.1% 


101 


0.0274 


0.0732 


gene21 6_U_-1 


86.1% 


140 


92.3% 


104 


0.0419 


0.0763 


gene216_L_+1 


87.0% 


138 


91 .8% 


104 


0.1059 


0.2969 


gene216_L_1 


99.3% 


140 


99.5% 


104 


1.0000 


1 .0000 


gene216_L_-1 


87.2% 


137 


92.2% 


103 


0.0992 


0.1655 


gene216J__-2 


92.7% 


137 


92.3% 


104 


0,8633 


1 .0000 


gene216_V_+2 


71.6% 


139 


79.1% 


103 


0.0717 


0.1519 


gene216_V_+1 


97.1% 


140 


98.0% 


99 


0.7685 


0.7655 


gene216J_1 


83.7% 


138 


89.2% 


102 


0.1094 


0.1323 


gene216__G_-1 


90.2% 


137 


90.1% 


101 


1 .0000 


0.4913 


gene216_F_+1 


64.1% 


128 


74.2% 


93 


0.0295 


0.071 1 


gene216_F„1 


97.9% 


140 


98.0% 


102 


1.0000 


1.0000 


gene21 6_DJ 


0.0% 


139 


0.5% 


104 


0.4280 


0.4280 


gene216_D_-2 


0.7% 


139 


1.0% 


101 


1.0000 


1.0000 
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TABLE 14 



Asthma Yes/No 


US population 




Froquencies 








ALLELE 


GENOTYPE 


GENE_EXON 


CNTL 


N 


CASE 


N 


P-VALUE 


P-VALUE 


gene216_T_2 


67.8% 


76 


61.1% 


27 


0.4053 


0.1776 


gene216_T_3 


9.5% 


74 


9.3% 


27 


1 .0000 


1.0000 


gene216_T_4 


94.7% 


75 


98.1% 


26 


0.4519 


0.4404 


gene216_TJ5 


79.2% 


77 


83.3% 


27 


0.5583 


0.7765 


gene216_T_6 


76.6% 


77 


72.2% 


27 


0.5819 


0.6932 


gene216_T_7 


94.2% 


77 


96.3% 


27 


0.7320 


0.7241 


gene21 6_T_8 


93.9% 


74 


96.3% 


27 


0.7308 


0.7226 


gene216_T__+1 


82.9% 


76 


87.0% 


27 


0.5262 


0.8281 


gene216_T_+2 


36.3% 


73 


33.3% 


27 


0.7416 


0.5739 


gene216_T_+4 


23.0% 


76 


27.8% 


27 


0.5795 


0.6743 


gene216„R_+2 


89.6% 


77 


94.4% 


27 


0.4127 


0.3874 


gene216„R_+1 


92.0% 


69 


80.0% 


25 


0.0334 


0.0361 


gene216_R_2 


7.4% 


74 


20.4% 


27 


0.0188 


0.0208 


gene216_R_1 


7.8% 


77 


24.1% 


27 


0.0030 


0.0055 


gene21 6__QR_+7 


75.7% 


76 


82.7% 


26 


0.3410 


0.0921 


gene216J3R_+6 


1 .3% 


77 


0.0% 


26 


1 .0000 


1.0000 


gene21 6_QR_+5 


50.0% 


77 


44.4% 


27 


0.5287 


0.6337 


gene21 6_QR_+4 


57.1% 


77 


63.0% 


27 


0.5218 


0.4709 


gene216_Q_+1 


48.1% 


77 


60.9% 


23 


0.1345 


0.3169 


gene216_Q_2 


75.3% 


77 


64.8% 


27 


0.1571 


0.1404 


gene216_Q_J 


89.6% 


77 


93.8% 


24 


0.5726 


1.0000 


gene216JJ_-1 


83.1% 


77 


87.0% 


27 


0.6654 


0.8280 


gene216_L_+1 


92.0% 


75 


77.8% 


27 


0.0116 


0.0123 


gene216J__1 


99.4% 


77 


100.0% 


27 


1.0000 


1.0000 


gene216__L_-1 


92.0% 


75 


77.8% 


27 


0.0116 


0.0123 


gene216_L_-2 


93.3% 


75 


96.3% 


27 


0.7362 


0.5089 


gene216_V_+2 


70.8% 


77 


69.2% 


26 


0.8614 


0.8889 


gene216_V_+1 


94.2% 


77 


94.2% 


26 


1.0000 


1 .0000 


gene21 6J_1 


87.2% 


74 


70.4% 


27 


0.0105 


0.0074 


gene216J3_-1 


91.8% 


73 


96.2% 


26 


0.3635 


0.3440 


gene216JF_+1 


67.4% 


69 


57.4% 


27 


0.2401 


0.3270 


gene216_F_1 


94.8% 


77 


92.6% 


27 


0.5136 


0.5043 


gene216__D_1 


0.0% 


76 


0.0% 


27 


1.0000 


1.0000 


gene216J3_-2 


0.7% 


75 


0.0% 


26 


1.0000 


1.0000 
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b. Bronchial Hyper-responsiveness : The analyses were 
repeated using asthmatic children with borderline to severe BHR (PC 20 < 16 
mg/ml) or PC2o(16), as described in the linkage section. First, sibling pairs 
were identified where both sibs were affected and satisfied this new criteria. 
5 Of these pairs, one sib was included in the case/control analyses if they 
showed evidence of linkage at the gene of interest. This phenotype was more 
restrictive than the Asthma yes/no criteria; hence the number of cases included 
in the analyses was reduced approximately in half. If the PC20O6) subgroup 
represented a more genetically homogeneous sample, one expected to see an 

10 increase in the effect size compared to the one observed in the original set of 
cases. However, the reduction in sample size could result in estimates that 
were less accurate and that could obscure a trend in allele frequencies in the 
control group, the original set of cases and the PC 2 o(16) subgroup. In addition, 
the reduction in sample size could induce a reduction in power (and increase 

15 in p values) in spite of the larger effect size. 

The significance levels (p-values) for allelic association of all typed 
SNPs in Gene 216 to the BHR phenotype are plotted in Figure 27 (combined 
population) and Figure 28 (US and UK populations separately). Frequencies 
and p-values for SNPs associated with the BHR phenotype in Gene 216 are 

20 presented in Tables 1 5, 16, and 17 for the combined population and for the UK 
and US populations, separately. Again, multiple SNPs in Gene 216 were 
associated with the phenotype in each separate population. In the UK 
population, the most significant SNP was in Gene 216, exon Q2, where 87% 
of the cases had the mutation compared to 72.9% for the controls (p = 0.0038). 

25 For the US population, the most significant association was found with the 
SNP in Gene 216 exon R 1, where 28.6% of the cases carried the mutation 
compared to 7.8% for the controls (p = 0.0041). 

In summary, Gene 216 associated with the phenotypes of both asthma 
and bronchial hyper-responsiveness. Association was found with multiple 

30 SNPs in both the UK and US populations. The 3' region of the gene, which 
contains the transmembrane domain, the cytoplasmic domain, and the 3* UTR, 
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appeared to have the strongest association. Taken together, these data 
strongly suggested that Gene 216 is an asthma susceptibility gene. 
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TABLE 15 



BHR 


Combined 
US and UK 


GENE__EXON 


Frequencies 
CNTL 


N 


CASE 


N 


ALLELE 
P-VALUE 


GENOTYPE 

P-VALUE 


gene21 6_T_2 


66.5% 


215 


67.7% 


62 


0.8294 


0.1358 


gene216_T_3 


8.7% 


213 


9.4% 


64 


0.8592 




gene216_T_4 


96.3% 


215 


98.4% 


62 


0.3878 


0.3797 


gene216_T_5 


76.7% 


217 


79.8% 


62 


0.5428 


0.5315 


gene2l6_T_6 


77.8% 


214 


78.3% 


60 


1 .0000 


0.8426 


gene216_T_7 


96.3% 


215 


97.7% 


64 


0.5856 


0.5786 


gene216_T_8 


96.5% 


211 


97.6% 


63 


0.7758 


0.7721 


gene216_T_+1 


85.2% 


216 


90.6% 


64 


0.1413 


0.3117 


gene216_T_+2 


37.3% 


209 


41.8% 


61 


0.3978 


0.6939 


gene2l6_T_+4 


24.4% 


215 


26.6% 


64 


0.6421 


0.2498 


gene216_R_+2 


88.3% 


217 


88.3% 


64 


1 .0000 


0.8975 


gene216_R_+1 


88.7% 


191 


89.2% 


60 


1.0000 


0.7540 


gene21 6_R_2 


90.6% 


208 


91.1% 


62 


1 .0000 


1 .0000 


gene216_R_1 


1 1 .3% 


217 


1 1 .7% 


64 


0.8750 


0.7576 


gene216_QR_+7 


78.1% 


215 


82.0% 


64 ~~ ] 


0.3876 


0.171 1 


gene216_QR_+6 


99.5% 


216 


1 00.0% 


63 


1 .0000 


1 .0000 


gene216„QR_+5 


46.4% 


210 


46.8% 


63 


1 .0000 


0.5530 


gene216_QR_+4 


51.5% 


205 


58.9% 


62 


0.1521 


0.3393 


gene21 6_Q_+1 


51 .2% 


206 


51.8% 


57 


i ,UUUU 




gene216_Q_2 


73.7% 


217 


79.7% 


64 


0.2009 


0.0664 


gene216_Q_1 


89.5% 


209 


94.2% 


60 


0.1565 


0.4299 


gene216JJ_-1 


85.0% 


217 


89.8% 


64 


0.1915 


0.5304 


gene216_L_+1 


88.7% 


213 


89.8% 


64 


0.8722 


0.9410 


gene216__L_1 


0.7% 


217 


0.8% 


64 


1 .0000 


1.0000 


gene216j__-1 


88.9% 


212 


89.1% 


64 


1.0000 


1.0000 


gene216_L__-2 


7.1% 


212 


8.6% 


64 


0.5661 


0.5313 


gene216__V_+2 


71.3% 


216 


75.0% 


64 


0.4343 


0.7291 


gene216_V_+1 


96.1% 


217 


97.6% 


63 


0.5874 


0.5802 


gene216_M 


84.9% 


212 


86.7% 


64 


0.6709 


0.8958 


gene216_G_-1 


9.3% 


210 


9.5% 


63 


1.0000 


0.9355 


gene216_F_+1 


65.2% 


197 


66.7% 


57 


0.8234 


0.3665 


gene216_F_1 


96.8% 


217 


97.6% 


62 


0.7752 


0.7715 


gene216_D_1 


0.0% 


215 


0.8% 


64 


0.2294 


0.2294 


gene216_D_-2 


0.7% 


214 


0.8% 


63 


1.0000 


1.0000 
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TABLE 16 



BHR 


UK population 


GENE_EXON 


Frequencies 
CNTL 


N 


CASE 


N 


ALLELE 
P-VALUE 


GENOTYPE 
P-VALUE 


gene216_T_2 


65.8% 


139 


74.0% 


48 


0.1635 


0.1885 


gene216_T_3 


8.3% 


139 


9.0% 


50 


0.8352 


0.6515 


gene216_T_4 


97.1% 


140 


98.0% 


49 


1 .0000 


1.0000 


gene216_T_5 


75.4% 


140 


81.3% 


48 


0.2641 


0.3646 


gene216_T_6 


78.5% 


137 


79.4% 


46 


1.0000 


0.9547 


gene216_T_7 


97.5% 


138 


98.0% 


50 


1.0000 


1.0000 


gene216_T_8 


97.8% 


137 


98.0% 


49 


1.0000 


1.0000 


gene216_T_+1 


86.4% 


140 


94.0% 


50 


0.0454 


0.1307 


gene21 6_T_+2 


37.9% 


136 


44.7% 


47 


0.2715 


0.4549 


gene216_T_+4 


25.2% 


139 


26.0% 


50 


0.8938 


0.1153 


gene216_R_+2 


87.5% 


140 


86.0% 


50 


0.7290 


0.6834 1 


gene216_R_+1 


86.9% 


122 


92.6% 


47 


0.1838 


0.3875 


gene216_R_2 


89.6% 


134 


94.8% 


48 


0.1494 


0.4752 


gene21 6_R_1 


13.2% 


140 


7.0% 


50 


0.1041 


0.3226 


gene21 6_QR_+7 


79.5% 


139 


85.0% 


50 


0.2983 


0.3872 


gene216_QR_+6 


0.0% 


139 


0.0% 


49 


1 .0000 


1 .0000 


gene21 6_QR_+5 


44.4% 


133 


49.0% 


49 


0.4771 


0.5020 


gene216_QR_+4 


48.1% 


128 


57.3% 


48 


0.1508 


0.2350 


gene216_Q_+1 


53.1% 


129 


48.9% 


45 


0.5407 


0.6988 


gene216_Q_2 


72.9% 


140 


87.0% 


50 


0.0038 


0.0128 


gene216_Q_1 


89.4% 


132 


95.8% 


48 


0.0613 


0.1924 


gene216JJ_-1 


86.1% 


140 


93.0% 


50 


0.0752 


0.2087 


gene21 6__L_+1 


87.0% 


138 


94.0% 


50 


0.0638 


0.2367 


gene216_L_1 


0.7% 


140 


1.0% 


50 


1.0000 


1.0000 


gene216_L_-1 


87.2% 


137 


93.0% 


50 


0.1400 


0.3796 


gene216_L_-2 


7.3% 


137 


9.0% 


50 


0.6623 


0.5686 


gene216_V_+2 


71.6% 


139 


79.0% 


50 


0.1860 


0.3615 


gene216_V_+1 


97.1% 


140 


98.0% 


49 


1.0000 


1.0000 


gene216J_1 


83.7% 


138 


91.0% 


50 


0.0952 


0.2406 


gene216_G_-1 


9.9% 


137 


10.2% 


49 


1.0000 


0.9269 


gene216_F_+1 


64.1% 


128 


73.3% 


43 


0.1466 


0.2885 


gene216_F_1 


97.9% 


140 


97.9% 


48 


1.0000 


1 .0000 


gene216_D_1 


0.0% 


139 


1 .0% 


50 


0.2646 


0.2646 


gene216_D_-2 


0.7% 


139 


1.0% 


49 


1.0000 


1.0000 



-165- 



WO 01/78894 



PCT/US01/12245 



TABLE 17 



BHR 


US population 


GENE EXON 


Freguencios 
CNTL 


N 


CASE 


N 


ALLELE 
P-VALUE 


GENOTYPE 
P-VALUE 


gene216 T 2 


67.8% 


76 


46.4% 


14 


0.0514 


0.0409 


gene216 T 3 


9.5% 


74 


10.7% 


14 


0.7369 


1 .0000 


gene216 T 4 


94.7% 


75 


100.0% 


13 


0.6065 


0.5986 


gene216 T 5 


79.2% 


77 


75.0% 


14 


0.6206 


0.6767 


gene216 T 6 


76.6% 


77 


75.0% 


14 


0.8130 


0.7738 


gene216 T 7 


94.2% 


77 


96.4% 


14 


1.0000 


1.0000 


gene216 T 8 


93.9% 


74 


96.4% 


14 


1.0000 


1.0000 


gene216 T +1 


82.9% 


76 


78.6% 


14 


0.5937 


0.6635 


gene216 T +2 


36.3% 


73 


32.1 % 


14 


0.8300 


1.0000 


gene216 T +4 


23.0% 


76 


28.6% 


14 


0.6296 


0.7242 


gene216 R +2 


89.6% 


77 


96.4% 


14 


0.4778 


0.4545 


gene216 R +1 


92.0% 


69 


76.9% 


13 


0.0321 


0.0452 


gene216 R 2 


92.6% 


74 


78.6% 


14 


0.0333 


0.0469 


gene216 R 1 


7.8% 


77 


28.6% 


14 


0.0041 


0.0072 


gene216 QR +7 


75.7% 


76 


71 .4% 


14 


0.6391 


0.2476 


gene2l6 QR +6 


98.7% 


77 


100.0% 


14 


1 .0000 


1.0000 


gene216 QR +5 


50.0% 


77 


39.3% 


14 


0.3130 


0.4007 


gene216 QR +4 


57.1% 


77 


64.3% 


14 


0.5371 


0.8691 


geno216 Q +1 


48.1% 


77 - 


62.5% 


12 


0.2724 


0.4060 


gene216_Q_2 


75.3% 


77 


53.6% 


14 


0.0233 


0.0331 


gene216_Q_1 


89.6% 


77 


87.5% 


12 


0.7250 


0.5718 


gene216_U_-1 


83.1% 


77 


78.6% 


14 


0.5910 


0.6593 


gene216_L_+1 


92.0% 


75 


75.0% 


14 


0.0149 


0.0227 


gene216J__1 


0.6% 


77 


0.0% 


14 


1 .0000 


1.0000 


gene216_L_-1 


92.0% 


75 


75.0% 


14 


0.0149 


0.0227 


gene216_L_-2 


6.7% 


75 


7.1% 


14 


1 .0000 


1.0000 


gene216_V_+2 


70.8% 


77 


60.7% 


14 


0.3730 


0.2711 


gene216_V_+1 


94.2% 


77 


96.4% 


14 


1.0000 


1.0000 


gene21 6_M 


87.2% 


74 


71.4% 


14 


0.0455 


0.0463 


gene21 6_G_-1 


8.2% 


73 


7.1% 


14 


1.0000 


1.0000 


gene216J=_+1 


67.4% 


69' 


46.4% 


14 


0.0510 


0.0665 


gene21 6_F_1 


94.8% 


77 


96.4% 


14 


1 .0000 


1 .0000 


gene21 6_D_1 


0.0% 


76 


0.0% 


14 


1.0000 


1.0000 


gene216_D_-2 


0.7% 


75 


0.0% 


14 


1.0000 


1.0000 
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EXAMPLE 13: Haplotype analyses 

In addition to the analysis of individual SNPs, haplotype frequencies 
between the case and control groups were also compared. The haplotypes 
were constructed using a maximum likelihood approach. Since existing 
5 software for predicting haplotypes is unable to utilize individuals with missing 
data, a program was developed to make use of all individuals and, hence, 
provide more accurate haplotype frequency estimates. Haplotype analysis 
based on multiple SNPs in a gene is expected to provide increased evidence 
for an association between a given phenotype and that gene if all haplotyped 

10 SNPs are involved in the characterization of the phenotype. In other words, 
allelic variation involving those haplotyped SNPs are expected to be associated 
with different risks or susceptibilities toward the phenotype. 

1 . Asthma phenotype : The estimated frequency of each haplotype 
was compared between cases and controls by a permutation test. An overall 

1 5 comparison of the distribution of all haplotypes between the two groups was 
also performed. In Tables 18, 19 and 20 the haplotype analysis (2-at-a-time) 
for all SNPs in Gene 216 is presented for the combined, the UK and the US 
populations, respectively. The diagonal entries represent the single SNP p- 
values, while the other entries are the p-values for a test of association 

20 between the asthma phenotype and the haplotypes defined by the 2 SNPs 
listed on the horizontal and vertical axes. The frequency of the individual SNPs 
in the cases and controls are shown at the bottom of the tables. Colored cells 
indicate p-values that were statistically significant (light gray: 0.01 to 0.05, dark 
gray: 0.001 to 0.0099, black: < 0.001). As seen in Table 18, haplotypes 

25 defined by SNPs T5 & T8, SNPs T+2 & QR+4, T5 & 17 and SNPs T4 & T5, 
yielded highly significant p-values of 0.00039, 0.000042, 0.00056 and 0.00042 
respectively, which were more significant than the analysis of these SNPs 
alone (T4 p = 0.16; T5 p = 0.04; T7 p = 0.16; T8 p = 0.25; T+2 p = 0.68; QR+4 
p = 0.04). These associations were also more significant than the one 

30 observed for the single SNP T+1 reported above. In the UK population, the 
most significant association was found in Gene 216 (Table 19) with five 
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haplotypes significant at the 0.001 level (SNPs T+2 & QR+4, p = 0.000021; 
QR+5 & QR+4, p = 0.00051; QR+4 & Q+1 p = 0.00066; QR+6 & Q2, p = 
0.00062; and QR+4 & Q2, p = 0.00023). Forty four haplotypes were significant 
at the 0.01 level in Gene 216 (Table 19) in the UK population. In the US 
5 population, numerous haplotypes were significant at the 0.01 level for Genes 
216 (Table 20). 
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2. Bronchial Hyper-responsiveness : A similar test for association of 2- 
SNP-a-time haplotypes with BHR (PC 2 o < 16 mg/ml) was performed. In Tables 
21, 22 and 23, the haplotype analysis (2-at-a-time) for all SNPs in Gene 216 
5 is presented for the combined, the UK and the US populations, respectively. 
One haplotype in Gene 216 (Table 21: SNPs T+2 & QR+4, p=0.0041) was 
significant at the 0.01 level in the combined sample. !n contrast, in the UK 
population, seventeen haplotypes were significant at the 0.01 level in Gene 
216 (Table 22). In the US population, nine haplotypes were significant at the 
10 0.01 level in Gene 21 6 (Table 23). Tables 18,19, and 20 and Tables 21 , 22 
and 23 showed similar patterns of significance with lower level achieved in the 
BHR analysis due to the reduced sample size in the (PC 2 o ^ 16 mg/ml) 
subgroup. 

In summary, haplotype analysis of SNPs significantly strengthened the 
15 evidence in support of Gene 216 as an asthma susceptibility gene. In some 
SNP combinations, the association was increased by an order of magnitude. 
The most striking association again appeared in the 3 ' region of the gene, in 
agreement with the single SNP analysis. 
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EXAMPLE 14: Transmission Disequilibrium Test (TDT) 

To ensure that the significant association observed in the case-control 
studies was not an artifact due to population admixture, a family based test of 
association, the transmission disequilibrium test (TDT) was conducted. By 
5 selecting a single affected offspring in each family, the TDT test performed a 
test of association (due to linkage disequilibrium) in the presence of linkage. 
The test determined whether a particular allele was preferentially transmitted 
to an affected individual over what would be expected by chance. Only 
heterozygous parents were considered informative for the TDT. In addition, to 

10 increase power, heterozygous parents transmitting a different allele to two 
affected offspring were ignored. Accordingly, the TDT would be based on the 
same families that contributed to the linkage signal. The significance levels 
were estimated by Markov Chain Monte Carlo simulation methods as 
implemented in TDT EX from the S.A.G.E. program (Department of 

15 Epidemiology and Biostatistics, Rammelkamp Center for Education and 
Research, MetroHealth Campus, Case Western Reserve University, Cleveland, 
OH (1997)). 

1. Asthma Phenotype : Five candidate SNPs were typed in the 
extended population in order to confirm the association seen in the case- 

20 control study. The five SNPs were in Gene 216 exons T5, T8, T+1 , R1 , and 
Q1. Since only heterozygote parents contribute information to the TDT test, 
SNP haplotypes (all 2-at-a-tirne and all 3-at-a-time) were constructed based on 
family data with the program GENEHUNTER (Kruglyak et al., 1996) in addition 
to analyzing the SNPs separately. This served to increase the informativeness 

25 of the single SNPs. These haplotypes were then used as "alleles" in future 
TDT analyses. In addition, p-values obtained from the TDT analyses were 
compared to the p-values obtained from the haplotyping in the case/control 
setting. To check for consistency, the p-values were recorded to compare the 
haplotype frequencies between the cases and controls of the over-transmitted 

30 alleles/haplotypes. 

The TDT results strongly supported the association previously observed 
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in the case control studies (Table 24). Three of the five SNPs showed alleles 
that were preferentially transmitted to affected offspring (p < 0.04 to < 0.0044) 
in either the combined or UK population. When these SNPs were haplotyped 
together, most combinations had a hapiotype that was preferentially 
5 transmitted to affected offspring (p < 0.03 to < 0.001). The most significant 
hapiotype in the combined population was composed of SNPs T+1/R1/Q1 (p 
= 0.0006). The most significant hapiotype in the UK population was composed 
of SNPs T5/R1/Q1 (p = 0.0005). In contrast to the UK population, none of the 
single SNP allele or multiple SNP haplotypes were preferentially over- 

10 transmitted to affected offspring at significant levels in the US population. This 
is most likely due to the combination of reduced power of the TDT versus the 
case-control study and the smaller sample size in the US. 

Importantly, for all of the single SNP or multiple SNP haplotypes the 
allele that was significantly over-transmitted in either the combined population 

15 or in the UK sample was more frequent in the cases than in the controls. A 
summary of the TDT analyses and a comparison between the Case/control 
and TDT results are presented in Table 24. 

2. Bronchial Hyper-responsiveness : The TDT analyses were 
repeated using only those asthmatic pairs that satisfied the additional criteria 

20 of having a Pd 20 ^ 16 mg/ml (Table 25). The vast majority of single SNP and 
multiple SNP haplotypes showed increased significance with the more 
restricted phenotype. P values reached levels of < 0.00008 for T5/R1/Q1 in 
the combined population and p < 0.000008 in the UK sample. Similar to the 
yes/no phenotype, for the majority of the alleles in both the combined and UK 

25 population, the over-transmitted alleles in the TDT were more frequent in the 
cases. Similar to the yes/no phenotype with the lelss powerful TDT test, no 
significant results were observed with smaller US sample. In summary, the 
analysis of single SNPs and SNP haplotypes by the TDT test provided 
confirmatory evidence for Gene 216 as an asthma susceptibility gene. 
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TABLE 24 



Asthma Yes/NO 


Combined 
US and UK 


Over-Tra nsm itted 
Haplotype 


Exon in 
Gene 216 


TDT p-value 


Case/Control 
p-value 


Control Frequency 


Case Frequency 


Q 1 


0.0337 


0.0213 


89.5% 


94.8% 


R 1 


0.0725 


NS 


88.7% 


88.2% 


T +1 


0.0956 


0.0055 


85.2% 


92.4% 


T 8 


1 .0000 


NS 


NA 




T 5 


0.1364 


0.0420 


76.7% 


83.3% 


R1Q1 


0.0042 


0.1362 


78.2% 


83.1% 


T+1Q1 


0.0932 


0.0049 


85.2% 


92.4% 


T8Q1 


0.0553 


0.0084 


86.0% 


92.9% 


T5Q1 


0.2659 


0.0342 


76.2% 


83.0% 


T+1R1 


0.0029 


0.0465 


73.9% 


80.6% 


T8R1 


0.0799 


NS 


85.1% 


67.9% 


T5R1 


0.0107 


0.1537 


66.1% 


71.5% 


T8T+1 


0.2762 


0.0044 


85.2% 


92.4% 


T5T+1 


0.3078 


0.0012 


72.5% 


83.0% 


T5T8 


0.0948 


0.0028 


73.7% 


83.4% 


T+1R1Q1 


0.0006 


0.0430 


73.9% 


80.8% 


T8R1Q1 


0.0086 


0.0552 


74.7% 


81.2% 


T5R1Q1 


0.0025 


0.1591 


65.9% 


71.2% 


T5T+1R1 


0.0136 


0.0175 


62.3% 


71.2% 


T8T+1R1 


0.0084 


0.0377 


73.9% 


80.9% 


T5T8R1 


0.0060 


0.0235 


63.0% 


71.5% 


T5T8Q1 


0.1242 


0.0033 


73.1% 


83.0% 


T5T8T+1 


0.1540 


0.0009 


72.7% 


83.0% 


T8T+1Q1 


0.1351 


0.0043 


85.3% 


92.4% 


T5T+1Q1 


0.1080 


0.0010 


72.5% 


83.0% 



NS = non-significant or over-transmitted allele not present more often in cases than controls 
NA = no alleles were over-transmitted 
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TABLE 24 (CONT) 



Asthma Yes/NO 


UK 


Over-Transmitted 
Haplotvpe 


Exon in ! 
Gene 216 


TDT p-value 


Case/Controi 
p-value 


Control Frequency 


Case Frequency 


Q 1 


0.0044 


0.0274 


89.4% 


95.1% 


R 1 


0.3665 


0.1473 


86.8% 


91.4% 


T +1 


0.0128 


0.0105 


86.4% 


93.8% 


T 8 


1 .0000 


NS 


NA 




T 5 


0.0434 


0.0426 


75.4% 


83.3% 


R1Q1 


0.0044 


0.0069 


76.2% 


86.5% 


T+1Q1 


0.0714 


0.0066 


86.4% 


93.8% 


T8Q1 


0.0342 


0.0275 


87.4% 


93.6% 


T5Q1 


0.1687 


0.0314 


74.9% 


82.9% 


T+1R1 


0.0269 


0.0018 


73.2% 


85.1 % 


T8R1 


0.4848 


0.0933 


84.6% 


89.9% 


T5R1 


0.0639 


0.0067 


63.1% 


74.7% 


T8T+1 


0.2254 


0.0069 


86.4% 


93.8% 


T5T+1 


0.2007 


0.0088 


72.9% 


82.9% 


T5T8 


0.0277 


0.0103 


73.7% 


83.4% 


T+1R1Q1 


0.0063 


0.0016 


73.2% 


85.1% 


T8R1Q1 


0.0139 


0.0039 


74.1% 


85.0% 


T5R1Q1 


0.0005 


0.0136 


63.4% 


74.2% 


T5T+1 R1 


0.0220 


0.0036 


61.5% 


74.2% , 


T8T+1R1 


0.0043 


0.0012 


73.2% 


85.1% 


T5T8R1 


0.0095 


0.0018 


61 .5% 


74.7% 


T5T8Q1 


0.0074 


0.0105 


73.3% 


82.9% 


T5T8T+1 


0.0255 


0.0082 


73.0% 


82.9% 


T8T+1Q1 


0.0207 


0.0087 


86.4% 


93.8% 


T5T+1Q1 


0.0127 


0.0093 


72.9% 


82.9% 



NS = non-significant or over-transmitted allele not present more often in cases than controls; 
NA = no alleles were over-transmitted 
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TABLE 24 (CONT) 



Asthma Yes/NO 


US 


Over-Transmitted 
Haplotyne 


Exon in 
Gene 216 


TDT p-value 


Case/Control p- 
value 


Control Frequency 
_ 


Case Frequency 


Q 1 


0.8039 


NS 


10.4% 


6.3% 


R 1 


0.1067 


NS 


92.2% 


75.9% 


T +1 


0.6288 


NS 


17.1% 


13.0% 


T 8 


1 .0000 


NS 


NA 




T 5 


0.7020 


NS 


20.8% 


lO./ /o 


R1Q1 


0.2134 


NS 


81.8% 


by.b /o 


T+1Q1 


0.6811 ! 


NS 


1 0.4% 


y./ /o 


T8Q1 


0.7584 


0.2887 


83.6% 


90.2% 


T5Q1 




MQ 
iNo 


Q 7°/, 


8.3% 


T+1R1 


0.0658 


NS 


75.1% 


63.0% 


T8R1 


0.0687 


NS 


86.1% 


72.2% 


T5R1 


0.1859 


NS 


71.4% 


59.3% 


T8T+1 


0.9465 


0.4778 


83.0% 


87.0% 


T5T+1 


0.8537 


0.5074 


9.7% 


13.0% 


T5T8 


0.8848 


NS 


20.8% 


13.0% 


T+1R1Q1 


0.1569 


NS 


75.2% 


62.7% 


T8R1Q1 


0.2386 


NS 


75.8% 


66.0% 


T5R1Q1 


0.0831 


NS 


70.7% 


59.3% 


T5T+1R1 


0.1332 


NS 


64.1% 


59.9% 


T8T+1R1 


0.1299 


NS 


75.2% 


63.4% 


T5T8R1 


0.0813 


NS 


65.5% 


60.2% 


T5T8Q1 


0.8654 


NS 


9.7% 


7.8% 


T5T8T+1 


0.8546 


NS 


9.6% 


9.3% 


T8T+1Q1 


0.6864 


NS 


10.4% 


9.3% 


T5T+1Q1 


0.8618 


0.9991 


9.7% 


9.7% 



NS = non-significant or over-transmitted allele not present more often in cases than controls; 
NA ss no alleles were over-transmitted 
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TABLE 25 



BHR 


Combined US 
and UK 


Over-Transmitted 
Haplotype 


Exon in Gene 
216 


TDT p-va)ue 


Case/Control 
p-value 


Control Frequency 


Case Frequency 


Q 1 


U.UoUU 


U. I ODO 


on co/ 

oy.o /o 


QA OO/ 

y^.i /o 


R 1 


U.Uo/4 


No 


Rft 70/ 


Rft V>/ 


T +1 


0.1252 


U. 14 I o 


oc no/ 


on fio/ 


T R 


■1 nnnn 
1 .uuuu 


NS 


NA 




I O 


n HQ/17 


UADO 1 


7fi 70/. 


80 2% 


oh r\A 
K I L4 I 


U.UU I f 




7R 9°/, 


83 7% 


1 


U.looo 




OC 0 0/ 
oO,£ /O 


mu.o /o 


T8Q1 


0.1616 


u.uyo ( 


Oft f\0/ 


w I .o /o 


T5Q1 


0.1496 


0.3214 


76.2% 


80.2% 


T+1R1 


0.0015 


0.1479 


73.9% 


80.2% 


T8R1 


0.0281 


0.7994 


85.1% 


85.9% 


T5R1 


0.0009 


0.6419 


66.1% 


68.4% 


T8T+1 


0.6224 


0.1380 


85.2% 


90.6% 


T5T+1 


0.4821 


0.0660 


72.5% 


80.3% 


T5T8 


0.1786 


0.1284 


73.7% 


80.2% 


T+1R1Q1 


0.0003 


0.1426 


73.9% 


80.4% 


T8R1Q1 


0.0035 


0.1298 


74.7% 


81.4% 


T5R1Q1 


0.0001 


0.4524 


65.9% 


69.7% 


T5T+1R1 


0.0052 


0.1332 


62.3% 


69.6% 


T8T+1R1 


0.0066 


0.1397 


73.9% 


80.6% 


T5T8R1 


0.0028 


0.2632 


63.0% 


68.4% 


T5T8Q1 


0.3680 


0.0954 


73.1% 


80.3% 


T5T8T+1 


0.5282 


0.0786 


72.7% 


80.3% 


T8T+1Q1 


0.3105 


0.1261 


85.3% 


90.6% 


T5T+1Q1 


0.5276 


0.0686 


72.5% 


80.3% 



NS = non-significant or over-transmitted allele not present more often in cases than controfs; 
NA = no alleles were over-transmitted 
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TABLE 25 (CONT) 



BHR 


UK 


Over-Transmitted 
Haplotype 


Exon in Gene 
216 


TDT p-value 


Case/Control 
p-va!ue 


Control Frequency 


Case Frequency 


Q 1 


0.0069 


0.0613 


89.4% 


95.8% 


R 1 


0.3285 


0.1041 


86.8% 


93.0% 


T +1 


0.0201 


0.0454 


86.4% 


94.0% 


T 8 • 


1 .0000 


NS 


NA 




T 5 


0.0367 


0.2644 


75.4% 


81 .6% 


R1Q1 


0.00078 


0.0052 


76.2% 


89.8% 


T+1Q1 


0.0209 


0.0280 


86.4% 


94.0% 


T8Q1 


0.0120 


0.0933 


87.4% 


93.8% 


T5Q1 


0.0974 


0.1624 


74.9% 


81.7% 


T+1R1 


0.0001 


0.0026 


73.2% 


87.6% 


T8R1 


0.2818 


0.1182 


84.6% 


91.0% 


T5R1 


0.0038 


0.0420 


63.1% 


74.6% 


T8T+1 


0.1437 


0.0327 


86.4% 


94.0% 


T5T+1 


0.0902 


0.0739 


72.9% 


81 .7% 


T5T8 


0.0536 


0.1052 


73.7% 


81 .7% 


T+1R1Q1 


0.000075 


0.0042 


73.2% 


87.8% 


T8R1Q1 


0.0031 


0.0056 


74.1% 


87.7% 


T5R1Q1 


0.0000078 


0.0331 


63.4% 


75.4% 


T5T+1R1 


0.0071 


0.0131 


61.5% 


75.3% 


T8T+1R1 


0.0023 


0.0034 


73.2% 


87.8% 


T5T8R1 


0.0073 


0.0216 


61.5% 


74.6% 


T5T8Q1 


0.0424 


0.0835 


73.3% 


81.7% 


T5T8T+1 


0.1380 


0.0761 


73.0% 


81.7% 


T8T+1Q1 


0.0322 


0.0319 


86.4% 


94.0% 


T5T+1Q1 


0.1096 


0.0756 


72.9% 


81.7% 



NS = non-significant or over-transmitted allele not present more often in cases than controls; 
NA = no alleles were over-transmitted 
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TABLE 25 (CONT) 

BHR 



US 







Uvcl - 1 raiibiiiiuyu 










ndpiuiypc 






txon in 


l D r p-value 


Case/Control p- 


\_sunirui rrequency 


Case Fi 


oene £. \ o 




value 






r\ a 
vj_ 1 


U.OUO I 


n 79^0 


•i n AO/ 


A O COA 


IX 1 


VJ.UO/ f 


MC 


92 2% 


71 .4% 


T 4-1 


u.o*+yo 


u.oyo r 


17 1 %. 


91 4% 


1 O 


i nnnn 




NA 






n 77Z.1 
U. f /*+ I 




on qo/ 
tU.O /o 


25.0% 


R1D1 


0 1259 




81 .8% 


58.8% 


M IUI 


u. f <t»o 




-in AOL 


91 A°/ n 


1 OU 1 


0 7514 


0 7864 


10 4% 


12.1 % 


1 | 


n moo 


U. 1 tfUO 


9 7% 


1 8.8% 


T+1R1 


0.2012 


NS 


75.1% 


50.0% 


T8R1 


0.0880 


NS 


86.1% 


67,9% 


T5R1 


0.0963 


NS 


71.4% 


46.4% 


T8T+1 


0.7557 


0.2626 


10.7% 


17.9% 


T5T+1 


0.4904 


0.0908 


9.7% 


21.4% 


T5T8 


0.8871 


0.9876 


20.8% 


21.4% 


T+1R1Q1 


0.0828 


NS 


75.2% 


50.0% 


T8R1Q1 


0.1759 


NS 


75.8% 


55.9% 


T5R1Q1 


0.2046 


NS 


70.7% 


46.4% 


T5T+1 R1 


0.1915 


NS 


64.1% 


46.4% 


T8T+1R1 


0.2537 


NS 


75.2% 


50.0% 


T5T8R1 


0.1633 


NS 


65.5% 


46.4% 


T5T8Q1 


0.6920 


0.3863 


9.7% 


16.1% 


T5T8T+1 


0.8586 


0.3158 


9.6% 


17.9% 


T8T+1Q1 


0.7517 


0.3367 


10.4% 


17.9% 


T5T+1Q1 


0.8579 


0.1166 


9.7% 


21.4% 



NS = non-significant or over-transmitted allele not present more often in cases than controls; 
NA = no alleles were over-transmitted 
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EXAMPLE 15: Attributable Risk Assessment 

From the knowledge of the frequency of a functional polymorphism and 
the relative risk of the heterozygote and homozygote (at-risk) genotypes, one 
5 can evaluate the attributable fraction (M.J. Khoury et al., 1993, Fundamentals 
of Genetic Epidemiology, J.L. Kelsy et al., (eds), Monographs in Epidemiology 
and Biostatistics, Oxford University Press, New York, NY, Section 3, pp 74-77) 
or attributable risk in the population. An attributable fraction of 25% would 
mean that if the population were monomorphic for the protective allele, the 
1 0 prevalence of the trait would be 25% lower. 

The formula for the attributable fraction is: 

Attributable fraction - ^ f) [ f) l \ + 1 ■ 

where f is the allele frequency, y is the relative risk of the heterozygote 
genotype over the wild type homozygote, and ri is the risk of the homozygote 
1 5 mutant over the wild type homozygote. This approach requires the estimation 
off, y and tj. ideally these quantities should be estimated in an epidemiological 
sample. 

The study design (genome scan with affected sibling pairs followed by 
association study using IBD = 2 individuals as cases in the case/control 

20 comparison) offers maximum power to detect linkage and association, but does 
not provide estimates of the required parameters, namely 1) the relative risk 
(or odds ratio) of the genotype/all ele for most SNPs or haplotypes and 2) the 
frequency of the SNP in the general population. In a recent paper, Altshuler 
et al. used the data from a TDT analysis to estimate allele and genotype 

25 relative risks assuming a multiplicative model or r\ = y 2 (D. Altshuler et al., 
2000, Nature Genetics 26:76-80). Thus, the mutant homozygote is predicted 
to carry a relative risk equal to the square of the risk for the heterozygote. 

To overcome some of the difficulties mentioned above that are 
associated with a case/control design, the data obtained from typing 5 SNPs 

30 in Gene 216 on the entire population (not just the subset of IBD = 2 individuals) 
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were used to estimate the relative risk of these 5 SNPs. The data from the 
TDT obtained by using the first asthmatic sibling per family were used. 
Because of the limited number of informative matings in the TDT analysis, a 
multiplicative model for the genotype relative risk was used as in the Altshuler 
5 et. al paper, i.e. r\ = y 2 . An interval on the attributable fraction estimates was 
made by constructing individual confidence regions for the allele frequency in 
the control population and for the attributable risk obtained from the TDT data. 
While combining these two confidence intervals to obtain a confidence region 
for the attributable fraction did not lead to a proper confidence region with the 
10 required coverage, it determined the variability involved in estimating the 
attributable fraction. As a short hand notation, this is referred to as a 
confidence interval with coverage equal to the one used for the constituent 
parameters. 

By using the control population to estimate allele frequencies, the 
15 attributable risk was underestimated. Based on these assumptions, the 
attributable risk for the single SNPs that were significant in the case-control 
study (p < 0.05) in either population was computed. The AF was also 
computed for all SNP combinations significant in the combined TDT analysis 
(p < 0.01) using the asthma phenotype. These values are shown below. 



SNPfs) 


Attributable fraction (AF) estimate 80% Confidence Interval 


Q 1 


50% 


17 to 65% 


R 1 


37% 


4 to 57% 


T + 1 


39% 


7 to 57% 


T5 


22% 


0 to 35% 


R1 Q1 


36% 


14 to 54% 


T+1 R1 


29% 


8 to 47% 


T +1 R1 Q1 


. 34% 


14 to 52% 


T 5 R1 Q1 


19% 


3 to 38% 


T5T8 R1 


24% 


9 to 41% 


T 8 R1 Q1 


32% 


11 to 50% 


T 8 T+1 R1 


25% 


2 to 44% 
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Because the alleles that confer increased risk of developing asthma are 
so common (haplotype frequencies ranging from 60% to 83%), their effect 
translated into a substantial population attributable risk, with estimates ranging 
from 19 to 50% for different SNPs or SNP haplotypes. These computations 
5 depended heavily on allele frequency and risk estimates. Proper estimates of 
the attributable fraction are based on a population sample and are only 
meaningful for functional SNPs or SNP haplotypes. 

Conclusion : Gene 216 has been demonstrated to be an asthma gene 
in accordance with the data disclosed herein, including: 1) localization to a 

10 region on chromosome 20 identified through linkage; 2) polymorphism analysis 
performed to identify sequence variants localized in the candidate gene; 3) 
genotype analyses of the identified polymorphisms; 4) association between 
identified alleles and the asthma phenotype in a case-control analysis; 5) 
association between identified alleles and the asthma phenotype in 

15 transmission disequilibrium tests (TDT), haplotype analyses, and analyses 
using additional phenotypes; 6) identification of transcripts in tissues relevant 
to pulmonary disease and/or inflammation; and 7) characterization of Gene 
216 as an ADAM family member. In addition to respiratory diseases, Gene 
216 is likely to be involved in obesity and inflammatory bowel disease, as 

20 obesity (Wilson et al„ 1999, Arch. Intern. Med. 159: 2513-14) and inflammatory 
bowel disease (B. Wallaert et al. t 1995, J. Exp. Med. 182:1897-1904) have 
been linked to asthma. 

EXAMPLE 16: Protein Expression And Purification 

Expression and purification of the Gene 216 protein of the invention can 
25 be performed essentially as outlined below. To facilitate the cloning, 
expression, and purification of membrane and secreted protein from the 
20p13-p12, a gene expression system, such as the pET System (Novagen), 
for cloning and expression of recombinant proteins in E. coJi is selected. Also, 
a DNA sequence encoding a peptide tag, the His-Tap, is fused to the 3* end of 
30 DNA sequences of interest to facilitate purification of the recombinant protein 
products. The 3' end is selected for fusion to avoid alteration of any 5 f terminal 
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signal sequence. 

Nucleic acids chosen, for example, from the nucleic acids set forth in 
SEQ ID NO:1 or SEQ ID NO:6 (Figures 24 and 29, respectively) for cloning the 
genes are prepared by polymerase chain reaction (PCR). Synthetic 
5 oligonucleotide primers specific for the 5 r and 3' ends of the nucleotide 
sequences are designed and purchased from Life Technologies. All forward 
primers (specific for the 5" end of the sequence) are designed to include an 
Nco\ cloning site at the 5' terminus. These primers are designed to permit 
initiation of protein translation at the methionine residue encoded within the 

10 A/col site followed by a valine residue and the protein encoded by the DNA 
sequence. All reverse primers (specific for the 3' end of the sequence) include 
an EcoRI site at the 5' terminus to permit cloning of the sequence into the 
reading frame of the pET-28b. The pET-28b vector provides a sequence 
encoding an additional 20 carboxyl-terminal amino acids including six histidine 

1 5 residues (at the C-terminus), which comprise the histidine affinity tag. 

DNA prepared from the 20p13-p12 region is used as the source of 
template DNA for PCR amplification (Ausubel et al., 1994). To amplify a DNA 
sequence containing the nucleotide sequence, c DNA (50 ng) is introduced into 
a reaction vial containing 2 mM MgCI 2 , 1 u.M synthetic oligonucleotide primers 

20 (forward and reverse primers) complementary to and flanking a defined 20p13- 
p12 region, 0.2 mM of each of deoxynucleotide triphosphate, dATP, dGTP, 
dCTP, dTTP and 2.5 units of heat stable DNA polymerase (Amplitaq, Roche 
Molecular Systems, Inc., Branchburg, NJ) in a final volume of 100 microliters. 
Upon completion of thermal cycling reactions, each sample of amplified 

25 DNA is purified using the Qiaquick Spin PCR purification kit. All amplified DNA 
samples are subjected to digestion with the restriction endonucleases, e.g., 
A/col and EcoRI (NEB) (Ausubel et al., 1994). DNA samples are then 
subjected to electrophoresis on 1.0% NuSeive (FMC BioProducts) agarose 
gels. DNA is visualized by exposure to ethidium bromide and long wave UV 

30 irradiation. DNA contained in slices isolated from the agarose gel was purified 
using the BIO 101 GeneClean Kit protocol. 
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The pET-28b vector is prepared for cloning by digestion with restriction 
endonucleases, e.g., Nco\ and EcoRl (NEB) (Ausubel et al., 1994). The pET- 
28a vector, which encodes the histidine affinity tag that can be fused to the 5* 
end of an inserted gene, is prepared by digestion with appropriate restriction 
5 endonucleases. 

Following digestion, DNA inserts are cloned (Ausubel et al., 1994) into 
the previously digested pET-28b expression vector. Products of the ligation 
reaction are then used to transform the BL21 strain of E. colt (Ausubel et al., 
1994) as described below. 

10 Competent bacteria, E. coli strain BL21 or E. coll strain BL21 (DE3), are 

transformed with recombinant pET expression plasmids carrying the cloned 
sequence according to standard methods (Ausubel et al., 1994). Briefly, 1 
microliter of ligation reaction is mixed with 50 microliters of electrocompetent 
cells and subjected to a high voltage pulse, after which samples were 

1 5 incubated in 0.45 ml SOC medium (0.5% yeast extract, 2.0% tryptone, 1 0 mM 
NaCI, 2.5 mM KCI, 10 mM MgCI 2 , 10 mM MgSQ 4 and 20 mM glucose) at 37°C 
with shaking for 1 hr. Samples are then spread on LB agar plates containing 
25 ug/ml kanamycin sulfate for growth overnight. Transformed colonies of 
BL21 are then picked and analyzed to evaluate cloned inserts, as described 

20 below. 

Individual BL21 clones transformed with recombinant pET-28b. 20p13- 
p12 region nucleotide sequences are analyzed by PCR amplification of the 
cloned inserts using the same forward and reverse primers specific for the 
20p13-p12 region sequences that are used in the original PCR amplification 

25 cloning reactions. Successful amplification verifies the integration of the 
sequence in the expression vector (Ausubel et al., 1994). 

Individual clones of recombinant pET-28b vectors carrying properly 
cloned 20p13-p12 region nucleotide sequences are picked and incubated in 
5 ml of LB broth plus 25 ug/ml kanamycin sulfate overnight. The following day 

30 plasmid DNA is isolated and purified using the QIAGEN plasmid purification 
protocol. 
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The pET vector can be propagated in any E. cofi K-12 strain, e.g., 
HMS174, HB101, JM109, DH5, and the like, for purposes of cloning or plasmid 
preparation. Hosts for expression include E. coli strains containing a 
chromosomal copy of the gene for T7 RNA polymerase. These hosts are 
5 lysogens of bacteriophage DE3, a lambda derivative that carries the lacl gene, 
the lacUV5 promoter, and the gene for T7 RNA polymerase. T7 RNA 
polymerase is induced by addition of isopropy!-0-D-thiogalactoside (IPTG), and 
the T7 RNA polymerase transcribes any target plasmid containing a functional 
T7 promoter, such as pET-28b, carrying its gene of interest. Strains include, 

10 for example, BL21(DE3) (Studier et al., 1990, Meth. EnzymoL, 185:60-89). 

To express the recombinant sequence, 50 ng of plasmid DNA are 
isolated as described above to transform competent BL21(DE3) bacteria as 
described above (provided by Novagen as part of the pET expression kit). The 
lacZ gene (p-galactosidase) is expressed in the pET-System as described for 

15 the 20p13-p12 region recombinant constructions. Transformed cells were 
cultured in SOC medium for 1 hr, and the culture is then plated on LB plates 
containing 25 ug/ml kanamycin sulfate. The following day, the bacterial 
colonies are pooled and grown in LB medium containing kanamycin sulfate (25 
ug/ml) to an optical density at 600 nM of 0.5 to 1 .0 OD units, at which point 1 

20 mM IPTG was added to the culture for 3 hr to induce gene expression of the 
20p13-p12 region recombinant DNA constructions. 

After induction of gene expression with IPTG, bacteria are collected by 
centrifugation in a Sorvall RC-3B centrifuge at 3500 x g for 15 min at 4°C. 
Pellets are resuspended in 50 ml of cold mM Tris-HCl, pH 8.0, 0.1 M NaCI and 

25 0.1 mM EDTA (STE buffer). Cells are then centrifuged at 2000 x g for 20 min 
at 4°C. Wet pellets are weighed and frozen at -80^C until ready for protein 
purification. 

A variety of methodologies known in the art can be used to purify the 
isolated proteins (Coligan et al., 1995, Current Protocols in Protein Science, 
30 John Wiley & Sons, New York, NY). For example, the frozen cells can be 
thawed, resuspended in buffer, and ruptured by several passages through a 
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small volume microfluidizer (Model M-1 10S, Microfluidics International Corp., 
Newton, MA). The resultant homogenate is centrifuged to yield a clear 
supernatant (crude extract) and, following filtration, the crude extract is 
fractioned over columns. Fractions are monitored by absorbance at OD 2 8o nm 
5 and peak fractions may be analyzed by SDS-PAGE. 

The concentrations of purified protein preparations are quantified 
spectrophotometrically using absorbance coefficients calculated from amino 
acid content (Perkins, 1986, Eur. J. Biochem., 157:169-180). Protein 
concentrations are also measured by the method of Bradford, 1976, Anal. 

10 Biochem., 72:248-254; and Lowry et al., 1951, J. Biol. Chem., 193:265-275 
using bovine serum albumin as a standard. 

SDS-polyacrylamide gels of various concentrations are purchased from 
Bio-Rad, and stained with Coomassie blue. Molecular weight markers may 
include rabbit skeletal muscle myosin (200 kDa), E. coli p-galactosidase (116 

15 kDa), rabbit muscle phosphorylase B (97.4 kDa), bovine serum albumin (66.2 
kDa), ovalbumin (45 kDa), bovine carbonic anyhdrase (31 kDa), soybean 
trypsin inhibitor (21 .5 kDa), egg white lysozyme (14.4 kDa) and bovine aprotinin 
(6.5 kDa). 

Proteins can also be isolated by other conventional means of protein 
20 biochemistry and purification to obtain a substantially pure product, i.e., 80, 95, 
or 99% free of cell component contaminants, as described in Jacoby, 1984, 
Methods in Enzymology, Vol. 104, Academic Press, NY; Scoopes, 1987, 
Protein Purification, Principles and Practice, 2 nd Ed., Springer-Verlag, NY; and 
Deutscher (ed), 1990, Guide to Protein Purification, Methods in Enzymology, 
25 Vol. 182. If the protein is secreted, it can be isolated from the supernatant in 
which the host cell is grown; otherwise, it can be isolated from a lysate of the 
host cells. 

Once a sufficient quantity of the desired protein has been obtained, it 
may be used for various purposes. One use of the protein or polypeptide is the 
30 production of antibodies specific for binding. These antibodies may be either 
polyclonal or monoclonal, and may be produced by in vitro or in vivo 
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techniques well known in the art. Monoclonal antibodies to epitopes of any of 
the peptides identified and isolated as described can be prepared from murine 
hybridomas (Kohler, 1975, Nature, 256:495). In summary, a mouse is 
inoculated with a few micrograms of protein over a period of 2 weeks. The 
5 mouse is then sacrificed. The cells that produce antibodies are then removed 
from the mouse's spleen. The spleen cells are then fused with polyethylene 
glycol with mouse myeloma cells. The successfully fused cells are diluted in 
a microtiter plate and growth of the culture is continued. The amount of 
antibody per well is measured by immunoassay methods such as ELISA 

10 (Engvall, 1980, Meth. EnzymoL, 70:419). Clones producing antibody can be 
expanded and further propagated to produce protein antibodies. Other 
suitable techniques involve in vitro exposure of lymphocytes to the antigenic 
polypeptides, or alternatively, to selection of libraries of antibodies in phage or 
similar vectors. See Huse et al., 1989, Science, 246:1275-1281. For 

15 additional information on antibody production see Davis et al., 1989, Basic 
Methods in Molecular Biology, Elsevier, NY, Section 21-2. Such antibodies are 
particularly useful in diagnostic assays for detection of variant protein forms, 
or as an active ingredient in a pharmaceutical composition. 

20 The disclosure of each of the patents, patent applications, and 

publications cited in the specification is hereby incorporated by reference 
herein in its entirety. 

Although the invention has been set forth in detail, one skilled in the art 
25 will recognize that numerous changes and modifications can be made, and that 
such changes and modifications may be made without departing from the spirit 
and scope of the invention. 
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WHAT IS CLAIMED IS: 

1. An isolated nucleic acid comprising a nucleotide sequence 
selected from the group consisting of: 
5 a. SEQIDNO:1; 

b. a nucleotide sequence encoding amino acid SEQ ID NO:4; 

c. a nucleotide sequence complementary to SEQ ID NO:1; 

d. a nucleotide sequence which hybridizes under high stringency 
conditions to SEQ ID NO:1 ; 

10 e. a nucleotide sequence which hybridizes under moderate stringency 

conditions to SEQ ID NO:1 ; 

f. a nucleotide sequence which hybridizes under low stringency 
conditions to SEQ ID NO:1; 

g. a nucleotide sequence which is at least 95% identical to the 
15 sequence of SEQ ID NO:1; 

h. a nucleotide sequence which is at least 80% identical to the 
sequence of SEQ ID NO:1; and 

i. a nucleotide sequence which is at least 50% identical to the sequence 
ofSEQIDNO:1. 

20 2. The isolated nucleic acid of claim 1 which is DNA. 

3. The isolated nucleic acid of claim 1 which is RNA. 

4. A vector comprising the isolated nucleic acid of claim 1 . 

5. A host cell comprising the expression vector of claim 4. 

6. The host cell of claim 5 which is selected from the group 
25 consisting of eukaryotic and prokaryotic cells. 

7. The host cell of claim 5 which is selected from the group 
consisting of bacterial, fungal. 

8. The isolated nucleic acid of claim 1, wherein the nucleic acid 

- 198- 



WO 01/78894 



PCT/US01/12245 



sequence comprises at least 50 consecutive nucleotides. 

9. A vector comprising the isolated nucleic acid of claim 8. 

10. A host cell comprising the expression vector of claim 9. 

11. The host cell of claim 10 which is selected from the group 
5 consisting of eukaryotic and prokaryotic cells. 

12. The host cell of claim 10 which is selected from the group 
consisting of bacterial, yeast, insect, mammalian, and plant cells. 

13. The isolated nucleic acid of claim 1, wherein the nucleic acid 
sequence comprises at least 1 5 consecutive nucleotides. 

10 14. A vector comprising the isolated nucleic acid of claim 1 3. 

15. A host cell comprising the vector of claim 1 4. 

16. The host cell of claim 15 which is selected from the group 
consisting of eukaryotic and prokaryotic cells. 

17. The host cell of claim 15 which is selected from the group 
15 consisting of bacterial, yeast, insect, mammalian, and plant cells. 

1 8. An isolated nucleic acid variant which comprises the sequence 
of SEQ ID NO:6, and contains at least one single nucleotide polymorphism set 
forth in Table 10. 

19. An isolated nucleic acid variant which comprises at least 50 
20 consecutive nucleotides of SEQ ID NO:6, and contains at feast one single 

nucleotide polymorphism set forth in Table 10. 

20. An isolated nucleic acid variant which comprises at least 15 
consecutive nucleotides of SEQ ID NO:6, and contains at least one single 
nucleotide polymorphism set forth in Table 10. 

25 21 . The isolated nucleic acid variant of claim 20, wherein the single 

nucleotide polymorphism is selected from the group consisting of T4, T5, T8, 
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T+1, T+2, R1, Q1, Q2, QR+4, QR+6, QR+7, and U-1. 

22. The isolated nucleic acid variant of claim 20, wherein the single 
nucleotide polymorphism selected from the group consisting of D1, F1, 11, L1, 
R2, T6, T1 , T2, T3, and 17. 

5 23. The isolated nucleic acid variant of claim 20 containing at least 

two single nucleotide polymorphisms selected from the group consisting of: 

a. T+2 and QR+4; 

b. QR+5 and QR+4; 

c. QR+4 and Q+1; 

1 0 d. QR+6 and Q2; and 

e. QR+4 and Q2. 

24. The isolated nucleic acid variant of claim 20, wherein the single 
nucleotide polymorphism is selected from the group consisting of: 

a. T5 and T8; 
15 b. T+2 and QR+4; 

C T4 and T5. 

d. T+1 and R1 and Q1; and 

e. T5 and R1 and Q1. 

25. An isolated nucleic acid variant which comprises the sequence 
20 of SEQ ID NO:1 , and contains at least one single nucleotide polymorphism at 

a site shown in Figure 24. 

26. An isolated nucleic acid variant which comprises at least 50 
consecutive nucleotides of SEQ ID NO:1, and contains at least one single 
nucleotide polymorphism at a site shown in Figure 24. 

25 27. An isolated nucleic acid variant which comprises at least 15 

consecutive nucleotides of SEQ ID NO:1, and contains at least one single 
nucleotide polymorphism at a site shown in Figure 24. 

28. An isolated alternate splice variant which comprises at least one 
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exon of SEQ ID IMO:1 set forth in Figures 9 and 10. 

29. An isolated alternate splice variant which comprises at least one 
exon of SEQ ID NO:1 selected from the group consisting of exons T, R, Q, and 
U set forth in Figures 9 and 10. 

5 30. An isolated alternate splice variant which comprises at least one 

exon of SEQ ID NO:1 selected from the group consisting of exons A, B, C, D, 
D\ E, F, G, H, I, J, K, L f L2, M, N, O, P, and S set forth in Figures 9 and 10. 

31 . An isolated alternate splice variant which comprises a sequence 
selected from the group consisting of SEQ ID NO:2 and SEQ ID NO:350-362. 

10 32. An isolated polypeptide encoded by the nucleic acid of any one 

of claims 1 and 8. 

33. An isolated polypeptide encoded by the nucleic acid of any one 
of claims 18, 19, 25, and 26. 

34. An isolated polypeptide comprising an amino acid sequence 
15 selected from the group consisting of: 

a. SEQ ID NO:4; 

b. an amino acid sequence which is at least 80% identical to SEQ 
ID NO:4; 

c. an amino acid sequence which is at least 75% identical to SEQ 
20 ID NO:4; and 

d. an amino acid sequence which is at least 65% identical to SEQ 
ID NO:4. 

35. An isolated polypeptide comprising ,atr least 20 consecutive 
residues of the amino acid sequence of claim 34. 

25 36. An isolated polypeptide comprising at least 7 consecutive 

residues of the amino acid sequence of claim 34. 

37. An antibody or antibody fragment which binds to the isolated 
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polypeptide of claim 32. 

38. An antibody or antibody fragment which binds to the isolated 
polypeptide of claim 33. 

39. An antibody or antibody fragment which binds to the isolated 
5 polypeptide according to any one of claims 34-36 

40. The antibody or antibody fragment of claim 37 which is selected 
from the group consisting of polyclonal and monoclonal antibodies. 

41 . The antibody or antibody fragment of claim 38 which is selected 
from the group consisting of polyclonal and monoclonal antibodies. 

1 0 42. The antibody or antibody fragment of claim 39 which is selected 

from the group consisting of polyclonal and monoclonal antibodies. 

43. An isolated nucleic acid comprising a nucleotide sequence 
sefected from the group consisting of: 

a. SEQIDNO:6; 
15 b. a nucleotide sequence comprising at least 50 consecutive 

nucleotides of SEQ ID NO:6; and 

c. a nucleotide sequence comprising at least 15 consecutive 
nucleotides of SEQ ID NO:6. 

44. An isolated nucleic acid comprising a nucleotide sequence 
20 selected from the group consisting of: 

a. SEQ ID NO:364; 

b. a nucleotide sequence complementary to SEQ ID NO:364. 

c. a nucleotide sequence comprising at least 50 consecutive 
nucleotides of SEQ ID NO:364; 

25 d. a nucleotide sequence comprising at least 15 consecutive 

nucleotides of SEQ ID NO:364. 

e. SEQ ID NO:365; 

f. a nucleotide sequence complementary to SEQ ID NO:365; 



-202- 



WO 01/78894 



PCT/US01/12245 



g. a nucleotide sequence comprising at least 50 consecutive 
nucleotides of SEQ ID NO:365; and 

h. a nucleotide sequence comprising at least 15 consecutive 
nucleotides of SEQ ID NO:365. 

5 45. An isolated polypeptide comprising an amino acid sequence 

selected from the group consisting of: 

a. SEQ ID NO:366; 

b. an amino acid sequence comprising 20 consecutive residues of 
SEQ ID NO:366; and 

10 c. an amino acid sequence comprising 7 consecutive residues of 

SEQ ID NO:M366. 

46. An isolated antibody or antibody fragment that binds to the 
isolated polypeptide of claim 45. 

47. The antibody or antibody fragment of claim 46 which is selected 
1 5 from the group consisting of monoclonal and polyclonal antibodies. 

48. An isolated antisense nucleic acid comprising at least 15 
consecutive nucleotides of a sequence complementary to SEQ ID NO:1 . 

49. An isolated antisense nucleic acid comprising at least 15 
consecutive nucleotides of a sequence complementary to SEQ ID NO:6. 

20 50. A vector comprising the isolated antisense nucleic acid of any 

one of claims 48-49. 

51 . A kit for detecting a Gene 21 6 nucleotide sequence comprising: 

a. the isolated nucleic acid of any one of claims 13, 20, and 27; and 

b. at least one component to detect binding of the isolated nucleic 
25 acid to a Gene 216 nucleotide sequence. 

52. A kit for detecting a Gene 216 amino acid sequence comprising: 

a. the isolated antibody of claim 42; and 

b. at least one component to detect binding of the isolated antibody 
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to a Gene 216 amino acid sequence. 

53. A method of identifying a Gene 216 ligand, comprising: 

a. contacting the isolated polypeptide of claim 35 with a test agent 
under conditions that allow the polypeptide to bind to the test agent, and 

5 thereby form a complex; and 

b. detecting the poiypeptide-test agent complex of (a), wherein 
detection of the complex indicates identification of a Gene 216 ligand. 

54. The method of claim 53, wherein the ligand is a metalloprotease 
inhibitor. 

10 55. The method of claim 54, wherein the metalloprotease inhibitor is 

a proglutamyl peptide analog. 

56. The method of claim 55, wherein the proglutamyl peptide analog 
is an analog of pyroGlu-Asn-Trp-OH or pyroGlu-Glu-Trp-OH. 

57. A pharmaceutical composition comprising the ligand identified 
15 according to the method of any one of claims 53-56, and a physiologically 

acceptable carrier, excipient, or diluent. 

58. A pharmaceutical composition comprising the isolated nucleic 
acid of any one of claims 1, 8, 13, 43, 48, and 49, and a physiologically 
acceptable carrier, excipient, or diluent. 

20 59. A pharmaceutical composition comprising the vector of any one 

of claims 4, 9, 14, and 48, and a physiologically acceptable carrier, excipient, 
or diluent. 

60. A pharmaceutical composition comprising the isolated antibody 
or antibody fragment of claim 42, and a physiologically acceptable carrier, 

25 excipient, or diluent. 

61. A pharmaceutical composition comprising the isolated 
polypeptide of claim 36 and a physiologically acceptable carrier, excipient, or 
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diluent. 

62. A method of identifying a human Gene 216 or ortholog, 
comprising: 

a. contacting the nucleic acid of any one of claims 1 , 8, and 13 with 
5 a biological sample under conditions that allow the nucleic acid to hybridize to 

a nucleic acid in the sample, and thereby form a complex; and 

b. detecting the hybridization complex of (a), wherein detection of 
the complex indicates identification of a human Gene 216 or ortholog. 

63. A method of treating a chromosome 20 disorder comprising 
10 administering the pharmaceutical composition of claim 57 in an amount 

effective to treat the disorder. 

64. The method of claim 63, wherein the chromosome 20 disorder 
is selected from the group consisting of asthma, obesity, and inflammatory 
bowel disease. 

15 65. A method of treating a chromosome 20 disorder comprising 

administering the pharmaceutical composition of claim 58 in an amount 
effective to treat the disorder. 

66. The method of claim 65, wherein the chromosome 20 disorder 
is selected from the group consisting of asthma, obesity, and inflammatory 

20 bowel disease. 

67. A method of treating a chromosome 20 disorder comprising 
administering the pharmaceutical composition of claim 59 in an amount 
effective to treat the disorder. 

68. The method of claim 67, wherein the chromosome 20 disorder 
25 is selected from the group consisting of asthma, obesity, and inflammatory 

bowel disease. 

69. A method of treating a chromosome 20 disorder comprising 
administering the pharmaceutical composition of claim 60 in an amount 
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effective to treat the disorder. 

70. The method of claim 69, wherein the chromosome 20 disorder 
is selected from the group consisting of asthma, obesity, and inflammatory 
bowel disease. 

5 71. A method of treating a chromosome 20 disorder comprising 

administering the pharmaceutical composition of claim 61 in an amount 
effective to treat the disorder. 

72. The method of claim 71 , wherein the chromosome 20 disorder 
is selected from the group consisting of asthma, obesity, and inflammatory 

10 bowel disease. 

73. A transgenic mouse whose genome comprises an introduced null 
mutation in an endogenous Gene 216. 

74. The transgenic mouse of claim 73, wherein both alleles of the 
endogenous Gene 216 of said mouse have been disrupted. 

15 75. The transgenic mouse of claim 74, wherein the mouse genome 

further comprises a human Gene 216 nucleic acid sequence. 

76. A method of making a homozygous transgenic knockout mouse 
comprising: 

a. disrupting an endogenous Gene 216 in mouse embryonic stem 

20 cells; 

b. introducing said embryonic stem cells into a mouse blastocyst 
and transplanting said blastocyst into a pseudopregnant mouse; 

c. allowing said blastocyst to develop into a chimeric mouse; 

d. breeding said chimeric mouse to produce offspring; and 

25 e. screening said offspring to identify a homozygous transgenic 

knockout mouse. 

77. A method of making a knockout mouse comprising administering 
the antibody or antibody fragment of claim 47 in an amount effective to disrupt 
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endogenous Gene 216 polypeptide function, thereby making a knockout 
mouse. 

78. A method of forming a crystal of the isolated Gene 216 
polypeptide of claim 36 comprising: 
5 a. incubating the polypeptide with a solution selected from the 

group consisting of the solutions in wells 1-30 in Table 1 under conditions to 
allow crystal ization; and 

b. detecting the crystalization in (a), whereby crystalization indicates 
formation of a Gene 216 polypeptide crystal. 

10 79. A method of diagnosing a chromosome 20 disorder, comprising: 

a. contacting the isolated nucleic acid of any one of claims 20-24 
with a biological sample under high stringency conditions that allow the nucleic 
acid to hybridize to a nucleic acid in the sample, and thereby form a complex; 
and 

15 b. detecting the hybridization complex of (a), wherein detection of 

the complex indicates diagnosis of a chromosome disorder. 

80. The method of claim 79, wherein the disorder is selected from 
the group consisting of asthma, obesity, and inflammatory bowel disease. 

81 . A method of diagnosing a chromosome 20 disorder comprising: 
20 a. contacting the isolated antibody or antibody fragment of claim 41 

with a biological sample under high stringency conditions that allow the 
antibody or antibody fragment to bind to an amino acid sequence in the 
sample, and thereby form a complex; and 

b. detecting the complex of (a), wherein detection of the complex 
25 indicates diagnosis of a chromosome disorder. 

82. A method of determining a pharmacogenetic profile comprising: 
a. contacting the isolated nucleic acid of any one of claims 20-24 

with a biological sample under high stringency conditions that allow the nucleic 
acid to hybridize to a nucleic acid in the sample, and thereby form a complex; 
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and 

b. detecting the hybridization complex of (a), wherein detection of 
the complex determines the pharmacogenetic profile. 

83. A method of determining a pharmacogenetic profile comprising: 
5 a. contacting the isolated antibody of claim 41 with a biological 

sample under high stringency conditions that allow the antibody to hybridize to 
an amino acid sequence in the sample, and thereby form a complex; and 

b. detecting the complex of (a), wherein detection of the complex 
determines the pharmacogenetic profile. 

10 84. A eel! line comprising the isolated nucleic acid of any one of 

claims 8, 19,26, and 28. 

85. A biochip comprising the isolated nucleic acid of any one of 
claims 8, 19, 26, and 28. 
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>BAC1098L22 sequence 

gctctaataaatttgcggccgctaatacgactcactatagggagaggatccgcggaattcccccatgtgccatgtccacg 
agatggttgaacagatgagacaacatggttgtgcagcttctctctttttttttttttcagacagagtctcactctgtagc 
ccaggctggagtgcagtggcgcaatttcagctcactgcaacctccgcctcccaggtcctgattcaagcagttctcctgcc 
tcagcctcctgagtagctgggattacaggaacgcgccactatgcccagctaatttttctaggtttagtagagacggggtt 
tcaccatgttgaccaggctggtcttgaattcctgaccttgtgatccgcccgcctcggcctcccgaagtgctgagattaca 
ggcatgagccaccgcatccggccatgcagcttctcttttctagagttaacaggagatgcggcaggtctgtagtagcccag 
aaattctcagacttgcactcttgaatttacacgcatgtacaaaatgcagctcagcctaatgcctcctgttcagttctgtg 
gtgctgcctacttcctgtgcccaaattagccttgcgtttataactaggaggaatgacttggtgtcctacaatattttggc 
tcctgggcttctttggtgttcattctattactagttgattttttttttcttggaaagaaaagtattcaaagagccagaat 
ttactttcattatatcttacaghtgttaaattaacctatccttgttttcactttctgtgtacttctttctttggggtaca 
aaacagtgtttctttagattgtctatttctaaatattattttaaccagaacataccaaatatattttcctgccagtttca 
ttcttcattttacttcttaaccattgttacgattttttttttaacttcggtctcagattttgctcaattgaagttgtccc 
agttcatgagattttgttttctttgcagctcttgaccatgacagatgtgaccagcacacagattgttacagctgcacagc 
caacaccaatgactgccactggtgcaatgaccattgtgtccccaggaaccacagctgctcagaaggccaggtcagaggct 
gtttcttaaagatttcagaaaaatcccaatttgtcataggtttagcttttatagtgtatatggtataaataatggcccag 
agttacttttcaaatgggtttctatttggattttattatccctgaggttttcctttaggaagagatgttctgtatatttc 
aagggtcctactagccctgggaatttgtgtattgtgttttagaagaaggtaggactgttgtccctgggcatggctgtgac 
taagcactggatccttggtttgaggcatctgatagtgacctcactttaccagtaccaggttttgataaagttagggtttg 
agagtgagtcaggtgtacagcggcccatattgaactgtggaaatgacagactttgcaaaatctccttgttttatatcttg 
gtttggcataatctcactgctcatcatatatagatttttaaatttaagttataagatgacccaggcttttatagtttttg 
acagcttacaagacttttttttttgtcagtctcatacagttcataaaatagaaaactttcagacttttgaagtgagcatt 
tgaaaagcaccaagttcaacattcccattttacagatgagcaggttgaggcatggtgtttaaaagagtgggctgcttcct 
tgccagttaaaagcatgtccttgagcagaacattccattgcagtgttccccactgcagagctactgcaacttatctatct 
atctgtctgtctgtctgtctatctatctatctatctatctatctatctatctatctatctgtctatctatttatttgcag 
tttgcttcctcaattagattttgttgtctataatatcctgttggtcctcaccataagttctctctgtggatacaaaagaa 
aagcttttgtccactgtgttctagatctccatttttaggtatgagaattgccccaaggataaccccatgtactactgtaa 
caagaagaccagctgcaggagctgtgccctggaccagaactgccagtgggagccccggaatcaggagtgcattgccctgc 
ccggtaggccttgcagggtcatcttggtgtgtgtgggtccattacttcagcctgcttcccccaacactgtgcagcctaag 
ttgaacctagcagaggggaagagctaattctgtccattcatcccccacacgagtattatgggcttttttgtttttaacta 
aaatacagttcttaagtatttgttcctactgtcctttgaaataaagtgaaacatcctttgctgctctgtagaattgagtg 
acagcttggttctgtatcactgagcctgctccttgtctctttcccatccacctttgataacctgggactagaccatgatg 
tctcagcagtcagtctgctgagcactttatggagagtacttcttattaaccactgggatttaattgttggcacctgctaa 
' tgggccttctctgagaaggagaggatagatacttctgtcagcagcaccttttaggggtgatctccagccctgaaaacctc 
aatatatcctgcttctgaggttcaggatgatgaactcagggcctgagaccagccagccatgtgatatatttggaccaggg 
tggtccagaaaggggaacttccttgtctgtgcacaacatctgagcttgttcagggaagttggtttggaccaggccctttt 
tgaatgtcccttggaagtttttgattcagcttgcagagtgggactcttttctatatctcagtgtgtctcaaattttaagg 
tacaggaaaagacctgggaatcttgcgaaaggcatattctggtgcaagtgtggggtggggcctagagtgtgcattttatc 
tagtcgactgctacactgtgaggagcaagtctttgtctgttcttaaatgatctctttcccatggtacctttcttttatct 
cagtgactgttactgttaatgaacattgttgatgtctccaaagtactttggttctggtgaagttgctttgttctttcatt 
gtcttccaggaagcattcatagcttctgtgcagtaccttgtgtgggttcaggatgatcacaggtagcagattacaagctt 
gtcttgtatgctatagccatatcacttgggttgtttctcaagaaggaccttctcaccttgcttttgggatgctttgtaca 
cttgattgtaccttccacctgatgatatgaaaacagtgcagcttttggagactatagatttgttaattccttgattcatt 
tccattcttgcagtttttaccccagccctccaatatgcatattcatttgtctgctcttcacttaggattttagttttcta 
attgttcttcagaaggaagtgtaccagtctaatattggcaccaaactggtgttttcatctaagacataggataagtgacc 
tcagaatatgctttttaggatccgggagatatcaccagtaaacattttaaaattcttgtattctgcatttggtccttaat 
aatgtgtcagaggctcccacatcctaatgaagtacctagaatttaaattagaaaggccatttcggtattcagtaatttga 
actcataatacagtagttttgtctgatttctaaaattcttttctttctcttttccccttaatgaaagaaaatatctgtgg 
cattggctggcatttggttggaaactcatgtttgaaaattactactgccaaggagaattatgacaatgctaaattgttct 
• gtaggaaccacaatgcccttttggcttctcttacaacccagaagaaggtagaatttgtccttaagcagctgcgaataatg 
cagtcatctcagagcatggtgagttaaaatcctcaaaacttaagtttctggttatccacctttctaccaagggcatgact 
gcagcttgcatgtggaaggctgtggatatgtgtaacgtgcttggcaagaaggggagtgctggtgaacgcagcctgaggga 
ctgtgggtttgtgctgtcagagtctcttcctcttaaaatttttaatactttgtatatataagatctatgaataattatat 
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gggggatgaattgtaacatgtatatgtgtacataatctggtgacatcagtagattatttcatacctgttttacctctgga 

ttctgctaggggagaaagagaggtcactgataattagctaggttggattaagccacctgagttccttggagttaaggtat 

tataatagtgcataagactgtataattaccactaagaagtgtacatctcagctggatgtggtggctcacacctgtaatcc 

cagcactttgggaggctgaggtgggtggatcgcctgaggtcaggagttcaagaccagcctgaccaatatagtaaaacccc 

gtctctactaaaaatacaaaaattagccaggcatggtggtgtgcgcctgtaatcccagctactcaggaggccgaggcagg 

agaattgcttgaacccaggaggtggaggtggcagtgagctgagattgcgccaccgcactccagcctgggtgacagagcga 

tactccgtctcaaaacaaaaaaagaacagcaaaaaaaagaaatgtacatctccttgggtctttcatagcccagccatctc 

aaaagaagagagcaccttcttgtcaagagtttctaagcccagaaaggctcaagttctctgcttgtccacccagtgctctc 

agggggcttatagtcaatattccatgatcacattttgtcatttttagtctggagtcataaattgtgatcctaccgtcagt 

taagtagactcatagaacaaagctctttcagcagtttcagctgtggtacagaaacgtttagtggaaatgttcttaccaag 

cagggaggagttgagggcaacacttccttgggtacagcctccttcatgtgaaggtatggaaatgtggcctgggtctcggc 

tgcctgtggcctctgtgtaccacctatagggccattctgagactggtaggaggtgccctgtatttagttttctccaatta 

gtcccttttttcagtgcaacttagatggggtatggacactcaaacattggtgacatattcttagtgtgtttacctcaggc 

tactgtgaccacatttggtatttcataatattttgatagctttttcagatttcagaatctccattggtgactgtctctgt 

tgtttctctttccatgtccaaatgtgggtctcttccagcattccatccttggctggcagctgacctttcccatagttatt 

cactccctcagaaaatggatggcacccagctttgcttattaccctggtgtctctaacaatgactctcgagtccacggaag 

ttaaaagggttcaaccaggtggccacagatatacttctggtaccctttctctcttccttaggttttctaactctaaacgt 

ttctgggtattctaatctgctgtggccacgtttatgaaacagaaattcacagtcttaatgataagactgacagatgagag 

acaactgaagtgtaatgtccttccacagctatacctctagatgtagcccagttagtagagcccagatttttatggaaaaa 

caagaaaggacacctagcctaacccttaggacagaggctggctggtagaaagctggaaggaggtgacccctgcaggtgag 

aaggagtgaattaggatgtcgaagacggaagggctttctgtgatttaattagtgcccccatctgtgagatgtagagggag 

atgattaagggagtggctctttgagtgagctgcaggtaagtttgcatggttggctgcaggctggatgggaggggattttt 

atagttgagcctcaggaaggaaccaaggccagaccctgcgaggccatggctacaatagtaatggatttgaattgtatcgt 

gaaggcaaagaaattattgaagggcctttaaaaagtatttttaattgtctttctttcttttggatctgtttatttttagg 

tcttttagtaggcatgtgtattttttcctctcaaaaatggaaataggctgggtgtggtggctcctgcctgtaatcccagc 

attttgggaggccagtgagggaggattgcttgagcccaggagttcaagactagcctgggcaacacagggagacctcgtct 

ctacaaaggaagtttttttaaagaattagccgggcatgatggtggcacatacttgtagttccggctacttgggaggctga 

ggcaagagggttgcttgagcccaggggtttgaggctgtagtgagccatgatcatgtccctgcactccagcatgggcaaca 

gagcgagaccctgtctcaaaagaaagcaaagggagggaaatacagtatatcttttgttttataactaccaaaattaggaa 

tacttaccatttcttggctaaactttatattttgatttttaaaacttgttaaaaattgcaatgagaaggaaatttcagga 

gagcagaagacagactgtcccaggtgtcactgtcctattattccctataaaatccagtgccaggatggatgaatggataa 

agcaaatgtggtataagttgaatatcccttatctaaaatgctttggacgaggagtgtttcgaattttagaatatttgcat 

tatactaaccagttaagcatccctaatctggaaatccaaaatgctctagtgaacattttccttcagcatccatcatgttg 

gcactcaaaaagttttagattttggagtgttttggatttcagattagggatactcagcctgtgtttgggggtagccatct 

cttcatatagacatttcagaacttaaatattgctttgctataatttctgtgaatttttgatatattatcttctctgagct 

acatttttatcctttataaaatggccatattgaagtgatgatctatcctaatctaccatggctgagtcaagggataaaga 

ggttttcctgtgtctgtggggtatacttaacttggtggtttttatctagaagcttgttttggtcaagatgttggttatat 

tcaggccaggcatggtggctcatgcctataatctcaacattttgggaggccaaggtgggaggatcacttgagctcaggag 

tttgaaactagccagggcaacatggcaagactccatctccaaaatttaaattaaaaaaagatactatctgtattcatagt 

tgtgtctcttttgcctttagtccaagctcaccttaaccccatgggtcggccttcggaagatcaatgtgtcctactggtgc 

tgggaagatatgtccccatttacaaatagtttactacagtggatgccgtctgagcccagtgatgctggattctgtggaat 

tttatcagaacccagtactcggggactgaaggctgcaacctgcatcaacccactcaatggtagtgtctgtgaaaggcctg 

gtaagttcacaggtgaattaggtggtattcagagtttattgtgagagaaaccataggaggcatagttcattgctgagatg 

tgtgaagtagtcatgaaaacagatgaagtattgatttcaagcatgcaaagaagagtataacccagatttcagaagcagaa 

ggaaatattctgggaccctgaatagttttaattataagcaaaactaaaaataactaacactactcgaagaaactgatatt 

ctaattaacaatgagattgataggtttattgaccagaaaaagtattgagaattgctctgaaaagcaaatttattggtgtt 

ggcagagaaatgctgtggaagaagaaacaaaaaaggaaaataaaaccaaagaaaagattagtaaagcaagcaagtgactg 

cagggacagtgttcagaaaggtagtgtcaacagggagaaaaatgatgagagcagttctgtaagcgaggacagaagcaacc 

cagaactacgcgagagctgcaggagagtactgagcagacagacactcggagtagcttcccagctttcagtctctctgggc 

tacattttggctaactaaaggcaggcccaggggtagctgctgcagcatgaagacaactcaaaataagcctcctcctcctg 

cagagggactcacaagcagtcggtggaattgtgggttattttgggaggctggactccatttgcattgtgtccaattttgg 

gaaagtaatttgtctgaggaattacagaggtataggagagaattcatagcattgtagatttctaaatattgatttctaaa 

cttctctaggggagctgcccagcagcctttcagaattctgaatcctttgttgaacttaatgaatgccgtagaccttcttc 
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cctccaaaattgcaaacatgagattttacatattcaagtggattgtaaaacccctaaagttcatcctgggtccccaaatt 

ctaaggggcttacagccctacttttatggaaagattcactgtttcctcatctttgtccttaatgctctttgaaaagaaaa 

tttattttttccacaagtgtgtaaatacacttgactaaacagtacttgatgacttttattgttttctattttctctcatt 

taattgatttcatggcacattaatgggtcatcacccacatttgaaaagtcttggctgggcacagtggctcacgcctgtaa 

tcccagcactttaggaggccaagacgggtggatcacaaggtcaagatatcaagaccatcctggctaacatagtgaaaccc 

catctctactaaaaatacaaaaaattagccagacgtgatggcgcacccctgtagtcccagctacttgggaggctgaggca 

ggagaatcacttgaaccggggaggcggaggttgcagtgagccgaaattgtgccactgcactctagcctggagacagagtg 

agactccgtctcaaaaaaaaaaaagaaaagaaagaaagaaaagtctttgaccttagcggacatggtgggagctctaagtg 

tctctcttgggtttcattcccagcaaaccacagtgctaagcagtgccggacaccatgtgccttgaggacagcatgtggag 

attgcaccagcggcagctctgagtgcatgtggtgcagcaacatgaagcagtgtgtggactccaatgcctacgtggcctcc 

ttcccttttggccagtgtatggaatggtatacgatgagcacctgcccccgtaagtgaaaaagggagccctaggcacttat 

gcatgccctctgtataggcaacaactcagccatgaggctgtgctgtcagcctctgaacattttagaaacaagactggaca 

tgacctctgctcaaacctgaccagagactgccatcgagaccttgctgcctattgagaaccttcatacagaatcaggcaca 

ttgacagtaaataaatgtaagatagatcacagagtacagaaataacttgtccaacttcagtgtcatattgctcaatccat 

gtaatatctccatatctgaatttcctaatttgtaaagtaaatgctttccaatagataatctctaaggtcccttttgcctt 

caacatcctgggattgagagaggagggaagggtcatctctgttatgtattgggcaaaatactgggctctttacattcatt 

atctcttttaaataatcaagacagaataatatttttgactcaagccagttgaatagtctgttaaaaaaaaagtaaataca 

gtgaattcagatctacctgtgatagtcaattgcaactttttttttttaatagctgaaaattgttcaggctactgtacctg 

tagtcattgcttggagcaaccaggctgtggctggtgtactgatcccagcaatactggcaaagggaaatgcatagagggtt 

cctataaaggaccagtgaagatgccttcgcaagcccctacaggaaatttctatccacagcccctgctcaattccagcatg 

tgtctagaggacagcagatacaactggtctttcattcactgtccaggtaagatgccttgcatatccaaattcaagtgttt 

cactactgatttatgaagaataaaaccttgaaagctacgttgtgtatatgtaactccctgccctcagcccctttccttcc 

tcctaatggttggtacaagaaggaatagaccagaagctggtccaaggcctgacctggacctgctgagagtggtggtgggt 

tcctaagaaaccaattctaagaaattggcctttgattcagacttgaagtgaccactcagcaatgtgtctgtgggtttcta 

gaacagttgggagaggctgggctggtgcaaagactcctcagagattagcagtcaagaacttctctaagagcctgccattg 

acaacagggctgtttgtgaggactttgtaagggaaagtccactgtaaacaaagctaaaagggcagagacagactgggaga 

aaatacctgcactgcatgtaacaactgatgatcatccagaatatgtaaactcccaccctccagggaaaaaataagaagct 

aaatgtgaacccaacagaaaaagtggttattggagataaagaagccatcctcagaaaaggaaatagaacaggaaaatgta 

aactaacaccaggaaccaattttttgtctactaaactggatcaaattttccgtgttcccttttttccatactaagatatg 

ggggacctgcaattctatttttaatctgctaggagttaactttttaacgaaatatttaaatctctgctttttcatgaata 

tcacgaatatatctggtaaaatgacaacccagagaatgggagaaaatatttgcaaagtatatatctaataagaatccaat 

gtccagaacacgtaactcttaaaactcaacaatagaaagacaacccaattaagaaatggataaaggatttaaatagacat 

atgtccagagaagaaatacaaatggccaataagcacatgaaaagatactcaatatcattcatcattaccaagaaatgcaa 

gtcaaagccacaatgagataccactttacacccactgagatggctgtaatcaataaaacaggtaataacaagtattggaa 

ggatatgtagaaattggaactctcatgcaggttggcaggtcctcaaaaagttaaatatagagttatcatacggcccagca 

gttttactcctaggtacatacccaagaaaattgaaaacatatgttacccaaaaacttgtatataaatgcttatgcttata 

gcatcattcttcatagcatcatgctcaaagcaacattattcataataagcaaaagtgaaaacaagccaaatacctgttag 

ctggcaaatggatgaacaaagtgttgtatattcatacagtgtaatgttatttggcaataaaaaggaatgaagtactgaca 

tattttctgacatggattaccctaaaaaaacatgctatgtaaaagaagccaggtgcaaaagattatgcgttgcatgatgc 

catttatatgaaatgtccaaagacagaaagtagatgtttagtggtttcctagggctgggcatggtaatgaagaataatag 

gcatgaggttttctagagtagctgcagcattattttctgacatggattaccctaaaaaaaatgctatgtgaaagaagcca 

ggtacaaaagattatgcattgcatgatgccatttatatgaaatgtccaaaggcagaaagtagatgtttagtggtttccta 

gggctggggatggaaacaaggaataataggcatgaggttttctagagtagctgcagtattattttctgacatgaattacc 

ctaaaaaaacatgctatgtaaaagaagccaggtacaaaagattatgcattgcatgatgccatttatatgaaatgtccaaa 

ggcagaaagtagatgtttagtggtttcctagggctggggatggaaacaaggaataataggcatgagattttgtagagtag 

ctgcaccattttatgctctcaccaacatcatatgaaggttcatatctttccacatccttgccaatacttgttactatctt 

tttaatgaaaactgttctagtggatgagaaatggcatctcactgttgttttcatttgtattttcctgataactaatgaaa 

ttgaacatcaacttcattagttagccttttgtatatcttctttggagaaatatttagtcaaattctttgcccatttttca 

gttatgttgtcttttttattattgagttgtaagagttctttatgaattctgaagtcccttattggatatattatttgcaa 

atactttctcccattatatgggtttcttttcactttcttaatatgccttttgaaatcaaaagttttcagttttgattaag 

ttccatgtatcagtgttttattttatcccatgtgctttttggtattgtatctagaaaatcagtgcccaagtaacccagga 

tcacatagatttgctcctaagttttctttgaaaagtttcatagatttgtgtgttacattaggtccttgatccattttgag 

ttaatttttgtgtatggtgtgaggtaggggtccaaactctctttttgcctgtggatgtggtcctgcaccatttgttgaaa 
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agatttttttttttttttaaccattgaattttcatggcacatttgttgcagatcagttgactgtgaatctaggaactttt 

ttctaagctctcatttttgtgcagttgaccaatgtttctccttttgccaatattacactgttttgattactgtggctttg 

tataagttttaaaattgggaaggctaagtcctccacctttgttcttattttgcaagactgttacagctattctgggtttc 

ttgcatgttcatattaattttaggatgagcttgccattttctgcaacaacaaaaagccaactggtttgataaggtttgca 

ttgaatctactgaccaatttggggtgtattgcctgtttgtttgtttgtttgtttgtttgtttgtttgttgagagagggtc 

tcactctgtcacccaggctggagcgcagtggtgctatcttggctcactgcagcctccgctttcccaggctcaagtgattc 

tccagcctcagcctctcgagtaactggtactacaggcatgagctaccaacacccagctaatttttatatttttttagaga 

cagggtttcgccatgttgcccaggctggtcttgaactcctgagctcaaagcaatcctcctggcctttgcctccctagttc 

tgggattacaggtgtgagccaccacgcctgggccccttgccattttaacaagggttaatcttctagcctgtgaacctgga 

atgtcaatttattaccctctttactttcttttggcattgttttatagttttcagtgtacatcattgtcttgttccttttt 

ttaggggaaaataattcagtctttattaaatatgatatttgtttagttttttttatacatgccctttattgggttgagaa 

agtttccttctattctagttttttgagggttttgtaatgaatgagtgctagattttgtcaaacactttttctgtgtctac 

ttagatgatcacgaagttttgttctttattaatactatgtattacattaattaagaatgttaaaccagttttgcattcct 

gggataaagcccatttagtcatgatgtatacttttttatcatgatgtatatttttaatctgctgttggattctatttgct 

agttggggattttgtgtgtacattcatgagggggatataagtttgtggttttcttcttttgtgattactatctggttttc 

atagcagggtagtactggcctcctagagtaagttggaacgtattcctcctcctctatttactggaagagttcgtcaagga 

ttagcattaattcttctttagatgttgattgaattcaccagtgaaaccaaatgggcctggcctttttctttacaggaaaa 

tttttaattattaattcaatctgtttgttatagatctattcagattttctttttctgagtgagtcagttttggtaatctg 

tttttctaggaatttgtctattctcagtaatcaaatttgttaacatacagttcgtagtattttcttttgttttttttttt 

ttttttttgagacagagtctcactctttcacccagtctggagtgcagtgacgtgatctctgcttactgcaacctccgcct 

ccagggttcaagtgattctcctgcctcagccccccaagtagctgggactacaggcgcacgccaccactcctagctaattt 

tttattttgttttgttttgctttttttagtagagacagggttttaccatgtagaccaggatggtctcgatctcctgacct 

cgtgatctgcctgccttggcctcccaaagtgctgggattacaggcgtgagccaccacgcccggccagttcatagtatttt 

cttgcaggttgctctttagcaatatgtagtctcctgtttaacctgttaacgatttttttttctgtctaatttggttttat 

tattcttttacattttgcaagggctctaaatattgtgagggtttttttttttaagagcttgctctcttatattgtggatg 

caataatttaagccttattgaatgtactaaaattcgtatttttcattctctcatatttcttgcattaacccttccttcag 

gatttgttgctgtgcgtgtttatcctggtggtattcggagattgagagttttcttatactggttaatcctcaaggttaat 

ttgtattaataactaaatgtgtagatctgtcaatattggtcagtggttgggtgtgctgagctgttgtgaaagtgtggtat 

gatgtcctggaaaagctagcaagccagcctcaggacactccagctctggtccacttcgactgttgttgactggtgtgcgc 

atctccctcattggaagaaggaaggagcaatggacctgggtggctgggtgcaattagcatgttcatgggtgtggcaagtg 

cctcccagccttgggtggtagggccacctaaagatagctggcacattgccttgagttttctttcattcttgcttgcttgc 

ttgcttttatttttgagatggagtttcactcttgttgcccaggctggagtgcaatggtgctttcttggctcactgcaacc 

tctgcctcccgggttcaagcggttctcctgcctcagcctcccaagtagctgggattacaggcacctgccaccacctctgg 

ctaattttttgtatttttagtagagacagggtttcactgtgttggccaggctgatctcgaactcctgacctcaggtgatc 

cacctaccttggcctcccaaagtgatgggattacaggcatgagccaccacacccagctgagttttctttcaaacactcaa 

atactaacaggtgcataaacagaacaagtacaaggctatggaaggttgccaagatggtgaaaaactgcatcacactgatc 

aaccaaggcagatctgaagaatcagattttgcttacagtgcaggttgtgcactatgtaaatgtttcccccatcctattcc 

cccctagagttctgcagtgcacagcctgtgcagctgtcctgagcagtcctcagaggtccaagcttcatatacctttctag 

ttgtaaagccctatatccattaggagtcttagtaatcccacttgtggaaatgtttatccgtaaagatggttatttttcta 

ataatgaaaaatgtaaagcaaactaaatattagaatataagggaatggttaaataactgttaagattcatcaaaatcatt 

ctaccatttgaagttttgtttataattgatgaaataaggtggtaaaatagttgtgatctaagttaagaaaggtgaatatt 

aatatgaacataattgtgcaaattatgttcttctggtatagaaaaataccagaagaaaatatatcaaatgtgaacagtgg 

tctctgagtggttggattgtatatgatctttccccatccttctattttttaagtatactgtcatgagcatacgttatttt 

ttagtgaataacaacaaaaaatattttttaatatcactatgggctttcaccttgctgtgatacattttcaattttcctgg 

actagctcttctttgatattttttatcttatttctcaaattagtttttagtgggccagttttggtgatttatcaaagata 

aattaactacagagatagttgcagataaatcttattgaacttaactacatgcagttcttccatggtacagaccccttgaa 

actctttttcttgcagcttgccaatgcaacggccacagtaaatgcatcaatcagagcatctgtgagaagtgtgagaacct 

gaccacaggcaagcactgcgagacctgcatatctggcttctacggtgatcccaccaatggagggaaatgtcagcgtaagt 

caaattggtcaggtttactcatggcaaatcggtgtggaacacagcacttctatttgacttgaatatctaggaggaaaaaa 

gccactttgttttggataccacatttcttattaaatcatactggtcagaagctcagctgagctggaagaaagaaccaagg 

tttcagtgtagctggttaaagcaacaagccaaaaatgttagagtcaagcctttgaccacattctgcagtggtataagcaa 

gggcccaaaaccagagaaagtgccaacttcccctgtttcctttcctccc'attgaacagggcacaggtcaatagtcagagg 

gatcagctcagacatgggagtcttggggctcatctcactgaggcccggacaaaaggttccagcagtatatggacactaag 
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gttgtgggaagggcaaggtgtaagggctaggactgggaaagtcctgggttggagtcagagttaggaaagtgtcttctgaa 

ttccccctcctcccccca-tcaggaggccttggtgcctgaagaagacctcagcccaaggcctcagatagggagcctgggaa 

taaaggctccagggctaggaatgtgaacatgtgaaggagcgtgacccgtgaggtagacctgagtccttaactgcacctct 

ctcagagccccacacttaggcccacctcacaggtcatgctctgccaggtaaagcctgtggcattgcctgtggtcaggccc 

tgaaaaatcacacacgggctttttaaccaggagccagactcctcaatacttatagtgtattttaaaggtttaaatagtcc 

cgtgttggctgggtgtagtggctcacgcctgtaatcccagcactttgggagggccaaggcaggtggatcacttgaggtta 

aggagtttgagaccagcctgaccaacatggtgaaaccccatctctactaaaaatagaaaaattagctgggcatggtggcg 

ggggcctgtaatccaagccactcaggaggctgaggcatgagaattgcttgaacctgggaggcggaggttgcagtgagcta 

agatcacgccactgcactccagcctgggcaacagagtgagactcagtctcaaaaaaaaaaaaaaagaaaaaaaaaaaagt 

cctgtgtttccccatttatgtcaaatagaagcctgcaagcaataggacattttattatgagaacaaaaattatgacaagt 

gatattaatgaactgctttttttattctagtttccagccaaaaaacataatgagtactatagtcgaaagacttttcaaag 

ttctgagcaggaaaagtaaacaacatagggaaatctctacctactccccagatcccaactccaaactctttggcatggcc 

cttcagaatctgatctgatccaagaccagccaggagagaggcagcgttctagctgtgtgtcccttactctccctggggtt 

cagtttcctcctaataatagggtggttgttaagatgaaagcagagcagcacatgacacatgttccacaggatactagtca 

atacttagtagatgtgggcaattcttattacccttctaatgccattctccactactcctaagtaaaagttcttgctcaga 

ttaaaaggtatttggtgaaaatgttttccccactctttgtgaatagacttaatccgttagacagtaaccagagtacctaa 

cagagtggtggagtgtacaaggcattgtgctgttacccactcaagacacaggtgcttttattatccccaagatcaagtaa 

actgcccagggtctcacagtgagatgtggccaagctaagaaccacgcccagtctgtgctgaccttaattacttggctggt 

tgtgcttccagatgccatcatacccacatcagcagctgctatctagaagcatcacattttcctctgtgagatctacaggg 

cctgagtacaatttgccttatttttcagagtcctaatccacggtagaaagcgtggtgtttatagaatactgcagatggca 

ccaagtctttgatgttttcttcttaaaattgtatagtccttaggtaacgatagtagccatgatttcttgaatgcttactg 

tgtttgagacatcatatacatatcatcacaaacacccctatgagataggtactgttgttatccccatcatatggacaagg 

aagctgaaactcagaaaggttaagtagcattcccagagtgacagaagagccaagtagaggacgctgaattcaaacccagg 

cagatggacgtaaggcctttgcttttcactatacattacacaccttcccccagtctcaacagagccaagtgtcagacact 

catgattctgacttcagcatccattctgttaatctctgcactattctgttttatagttttctttgggtgattccaaagga 

tttatttttaaagcatacctctctccagctgaatgcctttcatttattcaccagcaaaacagggtataaagtgaaaaggt 

gttgaccaaaaaggctttcacttttttcaactggagtataatatttatcacggcttgtattacgttggatgataaaagga 

gagatgttgagttggccagatgagggagatgggaagagctattttctgtaaaggcaaaggaaaaggtgaagcaagggtgg 

catgtcctgagggcccctaaagcgctgaccttggcattggatcctggttctgagccctgcggtgattccaaatgtgacca 

tcgtggagaaggcacctcagccaagaccagccctccttcagaggacagtcacctccagagcacctccctaggggcagcag 

ctatgtctgtgtccccacacccagcacctggcattctagagatgcttcacaaatatttattgaatgaatcaaagaatcat 

agcagtgaaaaagagagtctattgaaaagatcaaaatagtcattgcttcagaaggcagtgggtaccagcattcactttcc 

tcctgtcctttgtttggttcctttaatgttacctatacattattatcattagctgaaaaatcatgaatcttacccattga 

atgctcgtactttaatctgtctttcctactgaactcaattcagataaaattgcttgttttggaaaaagtctccaatagtc 

agaattttttaagccagttgtgacctctgttcctttttcttctctgcagcatgcaagtgcaatgggcacgcgtctctgtg 

caacaccaacacgggcaagtgcttctgcaccaccaagggcgtcaagggggacgagtgccagctgtgagtaccatactccc 

tggaccaccagggaggaccaagaggctgtgcagctgcctgaaccccaccctgagagccacccacttccctgtgtcttgtt 

gctgtgggctctgagggatccctgggttgattagtttgaaattttgcccattctatttcagacaggtcagtccccaaaat 

gaggaggtcgtcgagttagcagcaattccttaatggctcttgaattcacatttttgtttaaatgatactgacatttcctg 

ggttgtccatttggagtagtcattttaacttcagcaactacttgatttttgtcatgtcaagagattatactctttcccaa 

agagtagagatggaagagctggtgttttggtgatggcatttatttggcgtttggttgtctgattgtggaaatgatcctct 

tacctttaacatttcccattactctagcattttccttgttgaagcatgtcaggcatcttcctggagagggtttgaagtcc 

ccatccatgaggatgcaggtgcagcagcatcaggcatgtttaaggtttcagacctggccctgcctccactgagcaaggtg 

ctcttgaggcaaatcactgttccccccatctgtcagtagtgggactagcagtaattgtttttggtgtcctggtaggttga 

aacaggaaggatagttcctcaatcaatgtataccgggctgtctcactgggaaacctcataaaatgcatgtatccgcattt 

actcattctcaagaaaaaacttaaaaagtgtttagtgtcaaaacactgagctgattaatcattgtaaacatcattatttc 

ttaaataacagtagtaataatgccaccagtgattgagacatcattgctggtgctactc^gcagggtttggtctgtctact 

ttttagtttatatctgcatatatgtttactagcttcatgttgccctaaagccaatatgtgaagaaaagtctaatccaagg 

ttaaatttaacttttaggtccacattcccttaaacagaaaaatgtcatcgagcaaaattaatcactcttaccctcagtta 

gctgatgcaacaaagacatgaggtggcatcaagtaagactttcaagttccttcgagatcgcagatgtaggcaggtgctgc 

tcatcaccctggctaaagggacacagcatacctgcctgggaaaggccattacccacttcccgccctttcctcttcatgca 

tagctgttgtgtttacctaaatgcttcatgaagtgggaggctgggttttgctgatgtttatagatgatctgtatggggaa 

attgttttctgagtagaacatattttttttctacagactggtgaacacttgttccagggtttaaaaaaacagtgattctt 
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ggtatttagtcttctctcacttgtgtttaaagaagagaattagtttatccagggaacataggaaagaagtgagaaattag 

?atttgaaaaatgttttgactgcacttttagaagaatatgtagtccacacaaaattgaaaatgtttttcattatagtaaa 

taggtttattacatttggatttctctcattgctttggcaaatgcttgtcaattgtcctcatatagttggtgtgtgaactg 

ctqagcagtcagtattgaagcgttcatgcatgctcttcctagactgcctgagctgagttatggtgaaggatgcaattata 

atggctcSagactatctgtactttatataaaaggatctgtttggttttaaaatgaattcagtttctgttttaaatagcag 

tataaaatagtcttttttcccccccagatgtgaggtagaaaatcgataccaaggaaaccctctcagaggaacatgttatt 

gtaagtggttttgcaattcttatttctagaagcaaagtagttcagtaaaacttcattgtttaaacgggtttgagaatagt 

Lgtgctataagactatagcagccaccaatgaagtgttcccagacttgatatgtttacattctgttaagtttactacata 

taggagcactgttttaagctgttttaattgtgtttggggttaacgttaatgtgtccatagcagatagcaggagagagtag 

aqaggcatgcatctttgtctatccacatttatgttctcttaaaactttactttatttgccattacctagttggggtcatc 

a?atttcgtgttttaggatgtagatcaaaaacagaattcttacaatatggttgaactttttgtcattctgtctggaactg 

atttgtgttaaccaccttgaggtgaaaggtgagtctgacaaggtgaccatgtttttatgggtaaatgtgttttctcttta 

tgggagaatccacatggtagaccagagtacgggaccagaaaaaaggagttaatgttatggcatatccattgtaattatat 

acctgctgtactgggttgaataacatcccttcaaaattcatgtccacttggaacctgtgaatatgacttcatatatatat 

atataaaacttctaaaagaaaacaagagaaaaacatcttgaccttgggtggggaaaagttgttagataggattctaaaag 

cataattcataaataaaaagtagactaatgaaactatataaaaatttaaaacttgtaggaaaaaaccatccaaaacataa 

aaagccacagactagagaaaatactcgcacatcatcatatatttgataaggagtttatatccagaatacatgaggaactc 

ttacaactgaataagatgacaattccagttaaaaacaggcaaagatttgaaaagacatttcatgaaagaagatattacag 

tggccaataagcacattaaaaacgacttaacatcattcgttatttggaatggagattttaaaaccacagtcagatacagc 

cctaaagtgattagaatccaacgctaatgccatggctttttagaagacagtggtaatctcatgtctgcttctgcattcag 

tctgttqcagtacatctttttggttaaacacatgaaaaaaacctggccttaccaggcatgcagttggaaaagggtatagt 

gatacccttttaatagccttttcagataattatggacgttacttgatattatgctgaaactggacaactggtcatttctt 

taaagcggtttgatgggttaacccaaataaaatgatactgcccacagacttgattttctttgaggaacctgggtacagct 

gagttaaagtgctttcctcgttacttaatttataagaaaagcagcctgtatctctaaagacctgttctattttgtgtgtg 

tttagtttttaaatatgcatttctttctttccatagatactcttcttattgactatcagttcacctttagtctatcccag 

gaagatgatcgctattacacagctatcaattttgtggctactcctgacgaagtaagattttttaaagtcttcctattttg 

ttttgaatttgtatggatctttttcttggtcattacggatggacgtactgccttaacagtgctctccagactggagtaca 

cgagatgatctctagaggtataggaaagaaatgttagactctacgttatctcctttccacataaaaggcaaaagtgatgt 

taataagatttaccaggatcttagacacagactgacattgattccacgcatacttactctgcctgtcagcccatcatggc 

ctcatacagaaaggggactccaccatcagagggcagatagcagagcctgggtttactctctcaagagtgaccagaggctt 

aaagacactgctgattgaaatgccagtgatgcagccccaatcagacagcaagggaggggaccccaatcagacagcaaggg 

aggggaccccaatcagacagcaagggaggggacactctgctcctagagtgagttcttagcctcattaggaggcaaaacag 

caaaggcttagttgggtccattaagaagttagccaggaataatttaaattgttaaaatatgtatgtaaaatgtggatttt 

tttatctgctgtcattaacgatgaagcccaacctgccttaaggtattacctagtggtagaaggaaggccacactgcggga 

catttaaaactgaaacatacagaacacgaagatgcacctgtacagtttcttcaatgaaatataaagtcatgcagtaccca 

cttcagtatttaaagaaattttggtaaacataatggtaaattatttaggaacttccttggagatttcttacttctcatga 

acatacacaaagccatttaccattacaaaattccattaacaataaatgtgacaaataatacatggaaaacaatatggcag 

taagacaaagtatattgctttgttaacaaatgttatctataaacattgctaaatttaattttaaaagtagaacaaagtct 

atagtgtggtatgtttactatacaagtaaatgaaaataggatttgtttttaatattctatttcaaagataaaattaagaa 

aaaatgtactagaaaagatacattctcaaaagtaaattgctttttaagtaaaaataataaattactttaaatgaattatt 

ttatggaaaaactataggtattaatatatcatttgagtggctgtattgactggaatttttttcttaacgattttagaata 

agattttaacaaaagtaccatatatgaaatgtattcactgcctcataagcaagcgtttgaagtgctcaaaatctttcagg 

atatacttcatgccattaatgtcattaaaaaaataaatatagtagaatctttgtaatacttcttaaccaggtgggaagca 

accatgaaagtatttggacctttctggatttcccagtttatcttacgacagaatacttactagagttatccaaatgcata 

tgttctgtcctctataaaagcacaagcattttaagtttattgattctttctgtggaaagacatatagttgacccttgagc 

aatacaggttaaaactgtgaggatccacttacacgtagacttttttcaaccaagcgtggaatgaaaatacagtatttgca 

agatgtgaaacctgagtatacagagggtggacttttcatatgcaagggttctatgggcagactgcggactggagtatgtg 

tggatttgggcatgctcagggggtcctggaaccgatgacccacgtataccaaaggatgactgtaattgttagttgtgtgc 

tgccagcaaatactaaatacaaataagtaaacacttgagctgtcctttcaagatgaaggtgaggtcttatcagtaaatga 

gaaggtaaatgctttgtgagagaaacttctggtttaatgatcacattttaaaaatagctgtttggaaatgtttccattgt 

tgtgattttgtgctattaaaatgatcaaaacaacacccttaaaaatcttattctaacctctcaagatcttttaaaaatga 

ataatttcagtacagtcggatgcatctgtaaaagataaaaatataacattgattagtttgcaaaaataattgtttgaccc 

cagttaagagatgtactagtcaaatttcagtttgactaattattaatgttataatttacctaacatcaccaacagtacac 



FIG. 7 



WO 01/78894 



PCT/US01/12245 



14/157 

ttcctccactggcttaacagattcctcagcaatatctttattagtcattaagaccaaggatcaaaataaattaagttaga 

ttagccctgtgaactgctatatctctaagtggtagacaaggttttcaaaactaagaagccatactcaatcatatttctct 

tgaaataatatttttagtaagagaaaaatatttttcaaatcatgaatataataaaattattttttaaattaagtacattt 

cagctctatatgccttttttaaaggctgtttccagttttggagaagttttacactgtatataatctatgagtttagaatt 

atatgggtttcagttataaataaatataattttgggcacttgatcaagtgttttattggtaggagtgaacttcagacatt 

tgaaaacagctgcccagagaatccttaggagggtgattccccagcacaggtcaggagatgcaaatcactgtgctctgata 

ggggtctttttaaaaggcactttctcatgcagttaggcatttgataaagcattaaattatgtatactttaatgggaggga 

ataaaattttattggcatatattgcttgtattaaggaaaattggttagaaattttgctaactcttctgtgagtttctcta 

aagacatcataaaatgtttgtgattaagaacatttagaggagtaaactttattgctttattttaaaatctagaaattgtt 

ttaattaattttctaataatttgtaccgctcatcagcaaaacagggatttggacatgttcatcaatgcctccaagaattt 

caacctcaacatcacctgggctgccagtttctcaggtaaagacatacctagagaagaccctgcaaatgaaggtgtggtag 

attaagaaatgtaatataggaattgagaaagcgagctcaggagacagattggtttgaaacccacccttgccacttactag 

ctatgagaccttgagcaagtatctaaatccctctctaaaccttagcattattttattcatctgtaaagtgaggataatga 

tacttacctcttagaattgttgtatagattaaattaggttatacctaccagagcttgctgtggtgtcaggctcagtgtgt 

ggttactaccctagcccaccaccaccatttctgttcttgctgtggccactggcactaccatcatitgtctacatccgtgct 

tcggaagtgaaaaattcaaatgattcgtttcaataaatgaaaacattttaaataaaatgagattttagtaggtacagaga 

aatgtaacttgggaattacatcaagctctaaaagcacagctcttgctgtctgccttactgtgattcactgaagatctact 

gtatagaaaatctaaagaaataaaggatgaaggccaagtgctgtggctcatgcctttaatctcaacactttgggagactg 

aggcaggaggattgcttgagctcaggagtttaagaccagcctgggcaacatagtgagaccctgtctacaaaaaaaaaaaa 

aaaaaaacccaggtgtagtgactcatacctgtaatgccaactactcaggaggctgagatgagaggatcacttgagcccag 

gggttggagattgcactgagccatgatcgcaccattgtactccagtgtgggcaaacagagcaagatcccatctctttaaa 

aaaaaaaaaaaaaaaagaaaaaaggataatcactacttaacttgataactcaacaagtagatatgggtttgaaatttgtc 

cat taaatt tact tgcaccgtgctgttaggcaagttacttaaggtttctgagccagtttcctcctgtataaagtaggata 

gtaaaaacaccgtcctggcagggcgtgatagctcatgcctgtaatcccagcactgtgggaagccaaggtgggaagatcac 

ctgaggtcaagagttttgagaccagcctggccaacatggtgaaaccctgtctctactaaaaatacaaaaatcagccaggc 

gtggaggcacgtgcctgtaatcccagctactcaggaggctgaggtaggagaatcgcttgaaccttgaaggtagaggttgc 

tgtgagccgagatcacgccactgcactccagcttgggtgacagagtgaaactccatctcaaaaaaaaaaaataaaaaaac 

accttcccaagtagagtgatgtgagaattaaatgagataataaatgaagtactcaatatagtgcttgaaatgtggtaaat 

ggtaactatattttatcattattactattacaatactgggtttttaaaaatcaaaaacacaaagcaatgagattgatgca 

aaataagaatattgccttgtgcacgccacttacgtttatcatcttaaaacattgtgtagaatttgagaaaagttcagaaa 

ctctcaatgaggagggacttttaagaaaaagtctgaattatcagagtatttggagaaaggcaacatctccaggcatgtga 

aagatttgcaatgagccgggcggtggctcatgcctgtaatcctagcactttgggaggctggggcaggtggattacctgag 

gtcaggagttcaagaccagcctgaccaacatggtgaaatcccgtccctactaaaaatacaaaactagctgggtgtggtgg 

ggcgtgcctgtaatcccagctacttgggaggctgaggcaggagaattgcttgaacctgggaggcggaggttgcagtgacc 

caagattgcatcattgcactccagcctgggcaacaaaagcgaaattccatctcaaaaaaaaaaaaaaaaaaaaaaagatt 

tgccatgagtgtctcaatgaagacgtgataatgtgggctctagtcacagggtctaactcagacatggaaaaaagtccatt 

tcattaatctttatcggcacttgaattcctggctaagggagaatgtggaacattgaaggactctctgggaataggatgga 

gttataccagattagggggacttaaatactgtggtagctggtggtagaagggaggactgagtgaccccttgaacccctcc 

tccctgctacagtgggttaggcagtgagcggtacatcagcattactggcatgggagtctggcgcattgccaaggaggtgt 

aaaggggaaatgcaaaggaattgaagtggtgtgggcaaagtgaatgccagtgcttgttaataggattctagtggtatctg 

tattttcatgatcatgtgtgtcacctgtttgggggtggggcaagggtggaagggagttacatggattcctggtaaaacca 

tttttctttctttctttttttttttttgagacggaatttcgctcttattgcctaggctggagtgcaatggcgcaatcttg 

gctcactgcaacctctgcctcccaggttcaagcgattctcctgcctcagcctgctgagtagctgggattacaggcatgcg 

ccaccacgcctggctaattttgtatttttagtagagacggggtttctccatgtttgtcaggctggtctcaaactccgacc 

tcaggtgatccacccgcctcggtctcccaaagtgctgggattacagatgtgagccaccgcacccggccagtaaaaccatt 

ttggttaggggcataggcttgtatatcagcctgcccagctttaaatcttatctccatttcttgttggctgtgtgacttga 

gggaagttacttaatttctgtgaacctcaatttcccagtctataaatgaagataataatagggcagctgtgaggattaaa 

tgagattgagctcttaaaggctattactggaacacaggaaatgtttgataaatgctattgtccttattattaatgaggcc 

agattctgttctccccctagccccccaaaaaatgtctcttctctttcatgtttttcttttaacagctggaacccaggctg 

gagaagagatgcctgttgtttcaaaaaccaacattaaggagtacaaagatagtttctctaatgagaagtttgattttcgc 

aaccacccaaatatcactttctttgtttatgtcagtaatttcacctggcccatcaaaattcaggtaagaagaggcttttg 

gtctcatacctgcaaaggtggtgaaatctctttagtaagactaaatttactaatttggagcttgtggtaaatgagatgtg 

caatgtggctttgcctttgtaacgtgtatggcagaggagtgctgagcacatgcatgctgcacagaagattggagtgggga 
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tggactgtatcactcatgaaagacatttgcaaaagcactgttg'aaagcaagttggcatgtaacagatttggtcattataa 

acgtattcacttcttcagtgagcatttgccatgtgaaagcctgtagggctacacaaagaacttcaattctagagtaaggt 

ggatgtaagtgaaacaaacccacatataacaactgaaagccagagtgtggaaagtgacatgagatgatcccagaaatgct 

atcaaagtttaaaggaggacaaatggggagactatgttgaagaacatcagccttctagtcagacagaggtggattgattc 

ctggctcctatacaaatcatacagcctttccaagtctccagttctctgtgcatgtgacctgaagggtagttgtaaggggc 

tgtgcagctgctgcagtgtcttgttagcctgctcctttcctctgttcccagggggccagtgtactccctcttgtccgaga 

cccatggccccattttaactttttatactcatgtcccctggggcctttcctcaataccttctgcttcttaccttcttcat 

ttaggtgaatgtggaggttagggataggtgggctttcaaggactggttcacctttaaccatggaagcatggtcactggac 

ggaggctgttgctgtttgccaatgttcagaagcataatcaacctcagaagcaagtcaccacaaacatatgaaaaaagttc 

aacatcactgatcattagagaaatgcaaatcagaaccacagtgagataccatctcacaccagtcagaatgtctgttatta 

aaaagtcaaaaaagaacagatgctggcaaggctgtggagaaacaggaacgcttttacactgttggtgggagtgtaaatta 

gttcaaccattgtggaagacagtctggcaattcctcaaagacctagaggcagaaataccatttgacccagccatcccatt 

actgggtgtatatccaaaagaatataaatcattctgtaacaaagatacatgcacatgtatgttcattgcagcactattca 

cagtagcaaagacatagaatcaacctaaatgcccatcagtgatagactggataaagaaaatgtggtacatatacaccatg 

gaatactatgcagccataaaaaggaatgagatcatgtcctttgcagggacatggatggagctggaggccattatcctcag 

caaactaatgcaggaacggaaaaccaggtaccacatgttctcactcgtaagtgggagccgagaacacatggacacatggt 

ggggaacaacacacactggggcctattggagagtgatggggaggaaggagagcatcaggaagaatagctagtggatgctg 

ggcttaatacctaggtgatgagatgatgtgtgtagcaaaccactgtggcacacgtttacctatgtaacaaacctatacat 

cctgcacatgtacccctgaacttaaaaattaacaataacaaaaaaagcaagttatcactcatcaattaggatgccttggg 

actgtgactaaagaacagttgatctttccattctggaaatatggaggacaaaaactatgttgtagtttttctcaccaccc 

tctctcctttttcctgtctccatggtctaagatttgttaatcctctcatcaggttctccctttgccctcgcatactccct 

gtgtccctcctcagccagtcttgtaagcatcagccacgcttcttattcctgttttctcttgtctagtcacacatctgctc 

acaatggtctggcctcttccctcaccacatcctgaaactgtcctgtcatagtgggccccagtgatggaatttttcagtcc 

cctttttatctcattgcatggctttgaatcatcatctttcttgtcttcctggatttttcttttcctcgttttctggccag 

tctttttagggaatcctcttcctcttccttaaacactgggcttgtctaggattcagacctggtccttttctctcttcact 

ctgtttcctgcatgtgggacttgtggtgccacatctattctggtgattctcagtgctgtctccagaccggcactggcagt 

tgctgcctggacagccctacctgggtagtcaaagctccccgcgttcggtgcatattccttcttgtgatcttagccccaga 

cttccatgcttcctttctctgtaaacagcaccaccatcaccctgttgtacaagctagggcctgatgatcatttgcattcc 

tgggataaacccaacttgagcatgacgtattatttccttactttacacgttgttggattcagtttgctaatatttaggtt 

agaattttgtatctatttcatgagtaagaatgtctgataattttcccttcctgtccttgttatggttttgatatcatgtt 

aaactaacttttaataattataatttgaggagttttcactctttctcatctctggagatgtaagattagagttaactgaa 

cctcaagtacttggtagctctctcctgtaaaatcatttgatcctagtgtcttatttgtggagatactttttagctgctga 

ttcagcttctttaatagttataggatattttgaatttcctctttatttgattcactttttaatattttttctaggttttt 

atatcatgtttttaaatgtatttttaattttactttttaaattgatacataattgtacgtatttatgaggtacatgagat 

attttgatacatgcatacaatgtataatgatcaaataagagtaattaggatatccatcacctcaaacatttatcatttct 

ttgtgttgagtacatttcacatcttctagctattttgaaataatgaataagttattgctaactatagtcataatgactat 

tgtgccattgaacactagaacttattccttgtaactctatttttatacccattaacctctatctccccagccccccagca 

gctggtaaccaccattctgcactctacctccatgagatcagtttttttagctcccacatctgagtgagaaaacatatcta 

tctttctgtgcctggcttgtttcacttaacataatgacctccagttccattcatgttgctgcaaaggacaggattccatt 

ctttttgtgactgaataatatttcattgtataatatatacaacattttatttgtccattcatttgttgatggacacgtag 

gttgattccatgtcttcactcttgtgaatagtgctgtgataaacatttagagcatgcagtatctctccagtatactgatt 

ttctttcttttggatatatacccagcagcgggattgctggatcatgtggtggatctattatgagcagtcttcatactgtc 

ttccatagtggctgtactaaataatttacattcccaccagcagtgaactagtatcgtctttctctgtatcctcgccagca 

tctgttattttgtctttttaataatagccattttaactgggatgagatgatttctcattatggttttgatttgcatttgc 

ctgatggttactgatgttgagatttttttcatatgcctgttggccatttgtatgtcttctttggacaaatctctattcag 

atcatttgcccatttttaaatcaagtttttttcctattgagttgtttgaattgctggtatattctggttataaatccctt 

gttggatggatagtttgaacatattttctctcattctataagctgtctcttcactctcttgtttcctttgctttgtagag 

ctttttggcttgatataatcccatttgtcgatttttgcttttgttgcctgtgatttcgacatcttacacaaaacatcttt 

gcccagaccaatgtcctgaaggattttcccaatattttcttctagtagttttatggttttaggtcttatacttaagcctt 

taatccatttaaatttgatttttgtatgtggtgagagagaggggcctagtgttatttttctgcacatggatatccagttt 

tcccagcaccatttattgaagaacctgtcctttcccccactgaatattcttgactccattattgaaaaccagttggccgt 

gaatatgtggatttatttctgagttctttattttgttccactggtctgtgtatctgtttttatgctggtatcatgctgtt 

gtgggtactatagctttgtagcatattttgaagtaaggtgatatgatgcctccagttttgttctttttgctcagaaatgc 
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ttt agttggctgagtacagcagctcgtgcctataatcccagcactttgggaggccgaggctggtggatcacctgaggtca 
ggag?tc!!ga^ 

^SctgLatcccagctacttgggaggctggggcaggagaatctcttgaacccgggaggaggaggctgcagtgagccaa 
gattacLcactacattctaccctgggcaaacaaagcgagac^^ 

tQCtttggctatttgaggtcttctgtggttccatacaaattttaggattgttttttctatttctgtgaagaatgtcattg 

tgatttcttct^tgctagttcgttgctcctatatagaaatgattctgatttttgtgtgttgattttgtatcctgcaact 

ttactgagtttgtcagttcttagtttggtggagcttttagggttttctgtatataagattatatcaactgcaaataggga 

cactttgLttLtcctttcagtttggatgccctttatttctttcttttgcctagttgctctggtcaaatatgttgataa 

cttgttlatgctgtcctctaatgctcgcagcgtctgtgttcatgaaaccctttgttgtattccaatgttgatagca^ 

tcacagtaatcaaaatgtggaaacaacccaaatatctatcagtggatgaatggataaacaaaatgtggtatgtatgtgtg 

tgtgtgtgtgtgtgt^gW^ 

tLatgcgacaacScaagtgaaccttgaaaacatgctaagtgaaataagctagttgcaagaggacaaatattgaatgatt 
ccacttaacatgaaatatctagagtagtacacagattcttagagacagaaagtggattggaggttaccagcagctagggg 
gagcgggaaatagggaattactgcttcatggttattaaagagtttccctttggggtgatgaaaaagctttggaagtaggt 

Ig?gSIttggt?iLcaaaattgtgaa 

Statatcttlctacaataaaaaaagtctttaaaaatctcagcaagatgtttttgtagatatagacaagcttattctgaa 
atttatatgaaagggcaaaagttctagaatagctataacaatttagaaaaagaagaataaaatgggaagaatcaacctac 
ccaatgttLggcttactatttaggtacagtaatcaagacatt^^ 

taggctgatctataggtagatcccagaaatagaaccacataaatatgtccaactgattaggatttgattttgtttgtggt 
gtttttttgagacagggtctcactatgatgtccaggctggtcttgactcctgggctcaagtaaccctctcacctcagcct 

cctgaaS 

agggtctcactttgtcgcccaggccaaagtgtagtggtgcagtcacagcacactgcagcctcaacctccctggc^ 

ga^cctcctgcctcagcctcccaagttgctgggactacaggtgcataccaccatgcctggctaattcttttttttttttt 

?ttttttttttttttggtagagatgagatctccctgtgttgcctagtctggtctcaaactctcaaactcctgggctcaag 

tgatcctcccgccttggtctcccaaagtgctaagattacagacatgagccgttgcgcccagcccccagctgatttttgac 

aaaggccctgcaaaggcaattcaatgaagaaaggatagccttttaacgaatagtgctagagcagttggacaaccataggc 

aactaaaggaatctctaaatctcacatcttatataagaattaacgcaatattgatcacagatctaaatgtaaaacataaa 

actataaaacttttagaaaaaaatataggagaaaatctttgaatcccaaaacgaaacaa^ 

aaaaacgtgattcataaaaggaaaacttgataaactggaccacataaaaccaaaagtttttgctctgtgagaaaccctgt 

tatgaalaJgaaaaggcaagctatggactgggagaaaatatttgcaaaccatatttctgggaaagcacttgaatctagaa 

tataataactctcaaaactcaacagtaaacaaacgaacaattcagttagaaaatgggcaaaagacatgaatagacattta 

accaaagaggatatacatatggcaaataatgacataaaagatgatcggcatcaataaccattagagaggccaggtgtggt 

tcacacctgaaatcccagcactttgggaggccaaggcaggcagatcaccacttgaggtcaggagttcaagaccagcctag 

ccaacatggcaaaaccccgtctctactaaaaatacaaaaattagccaggcatggtggcacatgcctatagtcccagctac 

tcaggatactgaggcaggagaatcgcttgaacctgggaggcagaggttacagtgagctgagattgtgctactgcactcca 

gcctgggcaacggaatgagactgcctctcaaaataaaaataaaataaccattagagaaatgcaaattgaaagcacaatga 

lataacactacacacttatcaaaatgactaaaattaaaaatagtaataacaccaaatgttggagagagtggggaaagatt 

ggatcactcatacattgttggtgggaatataagatggtacagtcactctggaaaatactttgtttcttaaaaaactaatg 

ttatacttatcatataacccagcaattgcattcctaggcatttatcctagagaaatgaaaaccagtgttctcacaaatct 

ctgtacacccgtgtttatagcacttgaaagcaatatttaacagctttatttataatagcccaaaactggaaacaagccaa 

atattcctcaacaaatgaatgcttaaataaatgatgctacatacacaccatggcctaatacccagcaacaaaaagaaatg 

aactgttgatatacagaaacacaacaactttgataaatcccaagcaaattttgctgattgaaaaagaaaatcttagaaat 

ttatatactgcatgattgcatttatataacatttgaaaatgaaaattttagatcttatatacctatgcacatattgaata 

agtatcctgaagcttcaaatttctctgacttcttaggactctgatctctagaccctgttttaaaggagcttatctgatta 

qgtcaggcctacccacacaggagtagggaagatacagagcggtgcccatcaagaaggtggaaatcttgggggcattctta 

laattc^cctgctgtaggctgcatttggaatagagcttgcaggggtttcggtagtttctccgatccgaaaaaggttgag 

ctctaggtcccagtcagtataatttcagtttggcaacatctcctgacactattaagatatagattatatatatttgttag 

gttattttcatgtaaacacttgattagggttctagtaaataaaatttatttatttactcattttaagttgagattgtttt 

tgtactgacatacctctccatattctcaatctaattgaacatcgtgctttgttgcttatagctgacaaatgattaatact 

ttttagattcttagtgaaaatcttgacttatgctgctgacacttttcccagtctgtgcctataagcaggcttttgtgctg 
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gagatggccatacttgttgccaatgccgagtttttttcctcccagttccctattgctccaccctctgaaagtcactgaag 

tggttttagagctttcattctagagggagctcttccccacacttacttctaagaagtaataattttaagtttcataatca 

aggagatgttgattttcttcctggttttctggtgctccttatcttttctggatcactaacttaaattaagatccacttat 

gaatgctgcattcaggagcacttgtgaaaataaacgatggcatgcccaccatttagtttgccaagctttccttgaaaacg 

caggtaagttctattgcaaacgtgtctttttgctgtttggtacaaatgactgcactattcactgtgactaggcccgtgtc 

accgtgtgttctcagtcctcgcatgagagcctggttgacactggttctctctgctttggggaccgtatctggctgtgcgc 

tatgtggagggctgagctcgtgtcttcctgggtaaatacatcatgttctcaacaagggctgcatgtttctttattgctgt 

tttttccccagagacttagcttttctaggtgtgtagcatggctcttaaatacacacatttagcaactgtgcaaattcagt 

aaaaatggagagacttatttaaggaagacagtagggaaatgtctaaatagtgcagtgtaggccattcagttaaatttcag 

tatggctttcgatgcaggcaaagccgttttgcctttgattctcattttttaaattagattttaaaaattgagattctcct 

tttttaaatgagattttaaaaattgagattctccttttttaaatgagattttaaaaattgagattctccttttttaaatg 

agattttaaaaattgagattctccttttttaaatgagattttaaaaattgagattctccttttttaaatgagattttaaa 

aattgagattctcatttttttaaagcaaagtaactgaggaaatctaattctctacttggaaatttgtgtaagcattgggg 

ctttaaaaatggtaatactaaactcaagttccttcactctgaaccaatcccaggattcttttccataattaggaacagat 

caaaacctataactgctgagggaatttgaaactgactataaaaatctttgtagatgcagctaaaaaacaaacaaaaccca 

cagactgaaaagtcctgtttctgggagggtggcgggtgtgttggggccggttgtgcctgttttttgattggttgttcctg 

tgggtgagcacccttccctctgcccctgctttctgaaagctatcctagttgttctccccagtaagccggggctcagccgg 

cttcagctatgtccatagtttgggggcgtgggcggtaggtattcaagatagaagatggttggacagaaagtgtgtgaggt 

gggaaggtggagtggtgaggctctggctcgttcccgactctgcaacagaggttgttactgtaatgtggagctaatataga 

agtctgcgtgctgggtgtttcatctactgttttatgtaagaggccagtgttcacaagtaaaggcaacacagaattggtat 

tgagaactgggcattcatttagattttagagttctgtttatgggtgctaacaggatcgctatactgagcaagggatgtgt 

gaaacttttgttcacttggatgtgctttttttcttataacagctttattgagacatcatttacataccatacagttcgct 

catttaaagtacacaattcagtggttttcagtgtattcacagaagcgtgcagccacccgtcactacagccaattttagaa 

tattttcaccacttccagaagaagcccctagcagtcagtcctcattcccccctcttccggcaacccccagtctgtatttc 

ctgtctctgtggatttgcctattctggacattttatataaattgagtcacatgatatgctctcacttagcatactgtttt 

caaggttcacccgtgtttagtatgtttcagtacttattccttgtcatagctgaataatcttccttatatgtatataccac 

gttttgtttatccatttttctatcgatggatgcttgggtggttttcactctttggctattatgaatagtgtcagatatac 

ttttaatgtttatatttattgtataataacagtgcatatccgtacctcctggtgttaattgtctgtatcttaagagtttt 

aagaattat 

atggttgtgctgtaattctagcacttagggaggccgaggcgggcagatcgctttagctcaggagtttgaga 
ccaggctgggcaacatagtaaaaccctgtctctactaaagtacaaaaaattagctgggcgtggtgaagtgttcctgtagt 
cccagctacttgcaaggctgaggcacaagaatcacttgaacccaggaggcagatgttgcagcgagccaagattgcaccac 
tgtactgcagcctgagcgacagagcaagactcctgtctccaaaaaaaaaaaaaatttttctttatgatgaaacccacaag 
gctgtacaagaaccttaaatatagtttgaaacaaagaaagacacatgcttttaagtacagtttaattcttttcttttctc 
ttatttcattttttttttttttggagacagagtctcactctgtcgcccaggctggagtacagtggtgcaatcttggctcg 
ctgcaacatctgcctcctgggttcaagtgattctcgtgcctcagccttgcgagtagctgggactacaggaacacttcacc 
acgcccagctaattttttgtattttagtagagacagggttttaccatgttgcccccatcattctcagcaaactatcgcaa 
ggacaaaaaactgaacaccgcttgttctcattcataggtgcaaactgaacaatgaggacgcatggacacaggaaggggaa 
catcacacaccagggcctgttgtggggtggggggaaggggaagggatagcattaggggatatacctaatgttaaatgacg 
agttaatgggtgcagcacaccaacatggcatatgtatacatatgtaacaaacctgcatgttgtgcacatgtaccctaaaa 
cttaaagtataattaaaaaaaaaaaaagaaaaatgtgttattcctttgtctcgaaagtcccctgcttttcagcttttgct 
ttggttgtctcttgtcagctctttaatgctattgttcccttgggtttcatctggactctttttccctgggcaatctcatc 
cacaccatctgggaacattgcctttggctggacaggtaaatctttgtctctagcagacttctctcctgtgatctccagag 
ccaggaacccactagacttccttaaaattcattcagccagcagactttctggcccagagcagcagctgtgagtggacagc 
aggcacctcgttagaagcttggtgttggaggtgtgcccatgcattcctgggagccatgagaggggaggggagtgccctgt 
gtcacctgagtccaagttaagagagatcacaccagagctgattcctaaggagcatgggggcgggggaagggggtgttcgt 
ggtggggaaaggaaatgaccagccctttcaggaactaccaggagtttagcagagctggtgacagtgctatgagcaagaag 
gtggagagctggggagggccattatgaagagactggagattgcaggctgtgaacttgagtcctgatgtgactgggaattg 
ttgagatgttttaggccgaaaagtcaccgacgatcagtgtttagaaagaccccactggcagcagttgggagtgtggattg 
gaaggagccaggcctaagtcctgggaggacttaggaggtccaggagtccagggagtccaggtggaattcttgtgtagaaa 
cccagggaaggggaccatctgatgacacacatcccatagcaacttcctttcctcctctgcacctccacagcagagtgtct 
gtttccactatacgaagatttcttaatggcaaggatcgcatttttatccctcagtgtcctcagtgtcatgtgctaatcag 
ttggagacactccctaaatgtatgccagatgagtatatagttgttagtataagggctttacggtaggtatcagaactttt 
tcttttttacaacttttataacttttataaaggagccgacacctttttctgatattgtcagaacaattgctgaccagtgc 
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tatgggctggctgaggttggcaaccaagaatctgttatctcagtctacacagcgtggagttagaagattggattgttgtt 

ggttagaattttgtccagagagttagaagatagaagaaaagggagggctgaggacataaacatccaaattttacaaagaa 

gaaagttcagtgctcctgtctatattgaaatagttaagagggaaatgattgggtctggggaatgcttcaaaataatatga 

tcatagggaaagtaagtgtagtgtggatgaaacaaggttggccttgagctgataatttactaaagcaggatgatgagcgt 

gtgaggttcattatagtgctctttctactttttaaaaagcttaagattttccaaaatgaaaaagaagctgagcatggtag 

ctcacgtctgtaatcccagcactttgagagcctgagacggaaggatcacttgaagccaggagttcgagaccagcctgggc 

aatatagtgagacccccatccctactaaaaaagaaaaaaaattacacaatgaaaagaataatacaggtggtagaagaaga 

aataaaggtatgagtatatgatttggcctcctacagacagctacaggagcatgcctctgctgggcgggcctggagcatag 

tgcaccctggagctgaagttctgttcgcctttacaggaagtacacagcattgtacgggggttttaccagttaacagtgga 

cagtctctagaacactcaggggtcaggggcaaaactccctctccctcaaaccagcccacctgtcttggtgagaatggaaa 

acggagcttcccagcaagcctcctgccttcaccccgacgcacaggtgcatcagggtctgagggctcctccctgtggggcc 

tgggaagccgcccttcagtgggctatggctgcattgtcttctgatcttgtagctgtgtgtgggtaacaggcacatcgtct 

ccccatatcccagagagtcatgtccacttcctatctgtggtggcatctttccatcactgtcaccctcaggataatggcac 

aggaactaacagaggcactaagagcagtgatatatactggagggaggggcggggctgatgaccaaaatcaccttctgttt 

cagtaagaggttggttaaagtgatgcctaaaatggatagatcaggaaataacaataaaaacattaatgataaaaaggcaa 

Gtgccagaagaactgaaaagtaacctagttcaaggtgggggtccctggagagaagccttcagaagagcagaggaagagca 

gggaacattgcttttcatgactagcctttcagtgccttttgatatttcagctaagtatgtagttctctaatgatttctaa 

attgttaatgagtgtgatagaataggaattctggagaatggtcaaaaggaattgctaggagaaaattagggaaagaaagc 

acagagcagtattgaggtgggaggttgggagcagacagatggtgagtgttcgtttgcatggcaggaatgagtggctgaat 

ggcattaactggccatggggctctgaccagagcttgggtatacagtttctgtgccagcacaaaccagcattaggtgcgtg 

gtgtgtgaggccgggtgatggtcatatggatctggtccagagctgtggtcttgacatagagttccatgtatgaactttgg 

tggtggtggtggttttgtttcgttttttaagaaaaaaagaagagttgttttatgtcacagccaaactcaaggaggtgaca 

gcacaagacccaggttagaatgtacttagccagggctcacggaatcttccatcctttgcaaagccaagaagagtagaggc 

ctcaaggagaatagaggagcacatgggctccttctgggctgagctctcacccttggtggggccgctggtcccagggggtc 

gttgggagcctgcggagattctgcccagtacatacataccttgcgttttctcggagcctgacaggtgctgccttgaagcg 

gaaagacttcacctcacagggtggccagctgtcctttctcttcctctccctgtcctgtgtgctcatgataatggatagat 

agacatctccatggtagaaataatgcagccctttctgcagtccatggttccacagcatatggtcagtagacttagatgag 

tgctacataaagtgtaattagttctgcaccacggtgtcttaccttctgaaaaggaaactctggtattgacttagaaatgc 

cagttttatccttccgaccagaacgcaagtcctgaatgaggccagaggccaatgaaaggtcctttcttctctcttttcaa 

atctgctgtattaaacagtgacttttcctagtcattcagtattcattcattcatccatttattcaagtttttgagtacct 

tctttgtggatggcacatagtagatgcaaaaatacaatcatttttaaaatggaaagcatttgttatgtggtagtggaata 

taagaaatgccatgcttccatatttttatggtaatcttggactcaggttctacccaccatacttaattagaaattcattc 

tcagcctctgttttggcagagacccattctcttccctcaggaacatacagaaacctcagggctgaactgagctcagaggt 

tggggaaggtcccctggtctgatcttcagtcgctgactgctgcgcccgattccctggcccctcactcttgagggcagcat 

cctcactcttgtgtggcacaggcttggacatagaatgacacttaagcaccagagcctccctagcctgccacaaggacacc 

ctacattcccaggcactttcagaatgcaaaacctcccctccattccctccctttccctgcctccaggtttgggccctgcc 

tctccttttgtttcaaagcactgtcctctcttcctctacccttatccttccccatctccctcccactccccaaaagagcc 

aaactaattatggggatttaaccttccaattaaaggcgaggatttttttttttcaatgcagttcctgaaatgctaaagaa 

attgtccagggaaaggagcacaggaggagttttgttttgttttgttttgttttgttttgttttgttttgttttgttttga 

gacagtctcgctctgtcacccaggctggagtgcagtggtgcgatcttggcttaccgcaacctctccacctcctgggtttg 

agcaattcttgtgcctcagccccccaagtagctgggatcacaggtgcacaccaccacgccagctaatttttgtattttta 

gtagaaatggggttttgccatgttggccaggctggtttcgacctcctggcctcaagtgatccacccacctcagcctccca 

aagtactgggattaggcgtgagccactgcatccagcctgatgtttaaataatttcttacttcagagtttcctgaagtgtg 

gcctatggagactagcatcagaatcactggaggagcttaaagatatagagacagggatagggcctgagattctgactttg 

ttgtaagctatccagagtttgagagccatggcctagttcttggaatacaaaagtgtatctgctcttgtatgggaggtagg 

aagttaccttttcattcattcttctacaacagtaatttaaaagttaggaattggtatctt^catagacagaagaatgcagg 

aatcctcttctcaggatgtctgtctttacacagccatggtatatggtagaccaagttfegtccaacccatggcccgcgggc 

tacatgcagcccaggacggctttgaatgcagcccaacacaattcataaactttcttaaaacatgatttttttattttatt 

tttttttaagctcatcagcactatcgttagtgttagtgttagtgtattttatgtgtggccaaagacaattcttccagtgt 

ggcctagggaagccaaaagattggacacccctatatagaccataattatcacacgccctgagagggaagcagatcaatgg 

tgagggagcgtagaggagagcctgccagagtttccagtggggccggggatgcttaaaaacggaggcagtgatgtgctgag 

ccagaaaggacagggaggacccccatgcacacaggcagggaggctctctggtgacatctgccgtcaggtacagtagtaca 

gtagtagagtgagcggaagcacttttggctgaagtgcaacattagtgattgaggaagtgcaggtccgtaatatcttaccc 
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caaaccctggggctcactgagcttccgagttcagagtttttcagaattgatgaacacatccagaaggcccatggggcaga 

tacaccttagacatattgatgaaagtggagttggaattgggttgtggaagtcgtaggacgtgaattctaggtttcttgga 

gtgtggagtatggtgtctactgtttggaaacagagaagaatgacattatgagttttatttataaaataagaatttcttct 

ttgtattaaaatagatttttatataggaaattttcattaccttcagtatcagaaaggcccccaaatctatttcaccgttt 

ccataataaattgtgaaagaaagatttgattgggatattgtctttaaaaatgaaaccaaaaagtatgggaaaatacttta 

attttagagttaacaacatttatagttacaatcatgaagcaggtatggtgctgcttcatgagccaagttcataaaagaaa 

aaaaaatcagatcttactgagttctgtgagacagatcattatttttcttattaacatattcatttattcagcaaatgttt 

attaagtgccaattaatgtatcagtggctgctttaggcactggagatatatggtcagtgggataaggggaagtctctgtc 

ttcaatcctcacagatcttattttccataagtaagtgtacaaatacgggctatgaattaaatagagcaagataaagagat 

aggggtgtggcttcagttgtgcacatggtgcccagaaatctgatagctccatcatggagtccttcacttgagacttcaca 

gcagga^gtaaaaagcaacagccaggaaactccttagttttcccaaagtcctcccttaatctcgtttagagaattatgaa 

acacttcctatggcttcatcctttatcccttcttcagttgataactgaagggctaaatctgccaagtcttcctttgccaa 

tggttttgtgcgagcagttctccaacagcgcttccactaaaatcttgtcttttacaagatggattttacagttggttggc 

attgtatgacagtctgtgagtagggacagcccaatgtatgtttgttaatcaagtttatgccttttacattgagtaatttt 

taattacctcgtgtaatttttaacagcactctccttcatcttcagaagtgttcagtttgaacaataagtttgtcatcttg 

tcagtcaaaggttgaccaacaaagactgaccgcagtgggcggggtattcagcagggttagatgtttctaagtggtcagtg 

ctttcataaacattttcacgttatgacctttgcctcctcacacaatcttgtaactatggagacttagatgataaagattc 

aactaagaggatccctttaaagtcttgggtaccagccataccaaaagctgttttatatgaggttctttatgattataatt 

tggacatgaacatggagtggtcctgcttcagcaagaaatcaggctaataaccattagcatccagtgtgggtaaaggcggg 

ggctgccctgaggagtgttgtgtggcacagggcgcatgaacaccagataagtcctcagacagtgcccttagcatccagtg 

gtttttggcatctagcagggagtaaaagtatctcatagattctttaagtgtaaacaccgaggacacacttaacgttttaa 

aagattgaaatccagacagggGtggggtggactctaatctgtttccttctctctctttatgtgttataacttggattctt 

cattccccaaaatgaccttgtttatataattctccagtcttatctaaaacttttatgtgaggaccatggctcatattccg 

cagctttccttctctcaaggagctaggactggttattgtgataaatgaaggaaaagaaaagaaagtgggtttctgtttat 

tgtatgaaattgatgtaactttgtatttctgattataaatggtagccagagggaggacacaaaaacttcaaattccaagc 

cccataatatattatggcctgaatagtgttggaatttatttatttatggaatgtcaaaaagaaaatagggccatcaaaaa 

agcaaacagtggaagtctattttctggcccctgtgagtctggctctaaaaacagtgcaaggggttggggtttggtgggac 

taggaaagcagatgctcgtcatttaccttcctgtgtttgtcatgagaattgtctgtactgggattgtcaccttggtcagt 

aattgtttcgtttatagtaccattgtctctccaaagagggctgagttccctaaaaaaaagcaggcagcgcaccacaacat 

ctggctagtagcacgtatattatcggtgatagtaaggacacaattctcgtatttaatcaatttggaggtatgcatttttt 

cgtgggctaatatccctgaaatcatgatgcatcttatggtcagtgacttaaaaagcattatgtcatagttaaattgacag 

tgtttttcctaaattatacacagaataatgcctttttcagccaatgttattttagagttgatgaaatatgttgcattttt 

atattcttggtatgaagactattaactgtataataaaattatttaagttctgatgacgtagtgtcagatgatgttcagtg 

tagcaaaacgcgaggcacaattttttgtaaaaaggacaatatcttgcagtcagtacacactgagtacataaatgttgacc 

aaatgaagagcacagaatggggctgcccacctggaggcagagccatgaattggtactgaggcaaacaaggagagctttgt 

gtgtgggtggaggcgcagggagggagcgccaggcttcatggaggcatgcgagggtctccagggtcaaacagacagaacct 

cagaactggtgtgaagcttagaaggggctgcaggaggctgtgctaaagcctcagtgtattgctatgggactggcaactaa 

acaagtaaactaaaggggtgaggtacttttccttaagtgaaaaaagtttccttaacttctttaaagtgtaagctctgttt 

tcagcttaactcttgaaatgagaagcgtgaatccgtggtggtttcatgacctgcataatcacactacctgtttttctgtc 

acagattgccttctctcagcacagcaattttatggacctggtacagttcttcgtgactttcttcaggtaattttgcttat 

acagactctctctgctcttttgatttaggcactagatccattaatagagtatcagctcctaaaagataaaaagaataatc 

ctgtgtttccaccacagattaaatttacttcatatgtgttcctgaaaagttagatgtcagttacagtaatataaactcaa 

tctagcatgacaaagtgaaaacgtttggtttgtatgttctggcttccagcggggggctttgcatgtatcctgtattaatt 

ttgcatgcatcctatagataatttggctcttcatatggggagaatttggactaaaacaacatttaattaataatggttac 

tgatgtgagctacacatatgcttatgcttgttcaagagatgattctggtgattatccaaagttttaaagctgaaagtgat 

tttttagtgccctgggattttcttttatttccatagtttttttcgtagtctaattttatctttatgaaaatactatagac 

aaataattgaaaaagacaaaaaacgtgctgcagtttttaaaacagaaagcagcaatagcctgccttgcgccttccccacc 

ccacgtcccccaatttaaattgttttgttttttacttctctgtgtttacacttggtttttcagttaagttctcttttgat 

gtcctactgtagaagatgaggatatactgtgttacatggcctcccctagcatctcccctcacaccacaaaagcatacaaa 

agccccatctctttcatttaataccattgtatcataatgcatggctaaatcagtgatccctatctatgttatggccataa 

aactcttattcacagctgcattgaacagtgattacactaccacattttttgggtagaaggatcattctttgcttatccta 

aatgtagtatttgccatgttttggctttttaattgtttgatttttctatgtgcctacccctgatttatcactatactctt 

gctgatgtacctttagaacagtaaaaaataaaataaaataaagtatcattgtactgtgacagaactgagaaatctcacaa 
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tggtcacacacaggaaggtgctctcttggtttgaatttctctcggagttgcccaccctagaaccttcccacctcctgccc 

ccaagtgcacttgttcctgaggcctgctgcacaactgtcatttggaaacttcctttgccttcctcctatattgaagtctc 

tgtttcccagatcccatgtcttcaccttcctatccattccctctttttactggagcacattctctagatttatgagaaaa 

gatgcctgggagataaettttctgtgagctetectgactgaaaatatatttattctatcctacaatttgattgaatttct 

gggggcaggtgctatggctcacacctgtatttctagcactttgggagaccaaagtgggtgggtcacttgagctcaggagt 

tcgagaccagcctggacaacatggcaaaaccttgtctctacaaaaaatacaaaaattagctgggtgaggtggcgcaagcc 

tgtagtcccagctaatcaggaggctgaggcaggagaatggcttgagcctaggaggcagaggttgcagtgagcggagattg 

caccactgcactccatcctgggtgacagagtgagaccctgtctcaaaaaaaaaaaaaaaaaaagtctgactacagaattc 

taggttcaaaatcattttccattagaatttagaagacgttatactattttttctagtgtccagtataatattgagaaggc 

tagtgctgttataattcccactctttgtatgggatccgttttctctcttaaaaagaaaagtatatttaagtgtaatataa 

agttacagaaacatttgcaaatcctatgtgtgtagctcagtgaattatcacaaattgaacacttctagagtattgataat 

attatctcttgttcacccagaacacttcaggcttcttatttcttctctgtcagtttcggcaagttgtatatagttttctt 

ctaatgatcccattccatctgttccattctcttgttcactgagaacaagagttaatattaccaatactctagaagttcct 

cttgtgtttcccctcactcacaacctcctgcctcttccccaaaggtaccatttacctgacttctcaaaccatacattgca 

tttgctatttttgaaatgcatatgaattatttctttttttttttttttttttgaaacttagtctctctctgtcacccagg 

ccagagtgcagtgttgtgatcttggctcactgcaagcaaccgccgcctcccgggttcaagcaattctcctgcctcagcct 

cccaagtagctgggattacaggcacgtgccaccacgcatggctaatttttgtatttttagcagagacggggtttcaccag 

gttggccaggctggtctcgaactcctgacctcaagtgatccacccgcctctgcctcccaaagtgctgggattacaagcat 

gaaccaccgcgctcagtcaaaatgtatataaatgaaatcacaacataaattttaagatgatttggttgatgaagaaaata 

ctgaaaacttgtgggttgcagttaaagttgtgcttcgaggttatagctgtaaatctgtggctctccaccttagctgcacc 

tagaatcacttgcggagcttttgatgtccaagccactcttcagaccatttgactcagaacctctaggggtgcaccccaaa 

catttgaaaactctccaggtgattccaatgtgcagctaaggttgagaatcactgctttaaaagcacatattagaaaagaa 

aaaacattcctctcaataactcagaactgctaattaaacctaaagaagtagaagaaagaaaaatataaataagagcaaaa 

attaatgatctaagggaagaagtacagtagagaggatcttaaaaactgtcagttggatctttgaaaagagtaagaataat 

attgacaaacccctggcaagattaaccaaggaaaaaagagagaaggtatatggtatatgtagccaatgtcagaaataaaa 

aagggggcatccctccaaatccagcagacgtttttttcct'ccagaagggtatttgatgaacaactctatgtgaatgaatt 

tgaaaatgaaacagatggaatgggaccattaaaagaaaaatatgcaacttgccaaaactgacacagaagaagtaaggagc 

ctgaatagttctgtatccatttttaaaacaacactaatttaagttttgtcatagtgaaaactctaggccccaggtggctt 

cattggtaacataacctttttaggccgggtgcagtggctcacgcctgtaatcccagcactttgggaggccgaggtgggtg 

gatcacgaggtcaggagatcgagaccatcctggctaacacggtgaaaccccatctctactaaaaatacaaaaaattagcc 

aggcgcggtggtgggcgcctataatcccagctactcgggaggctgagacaggagaatggcgtgaacctgggaggcagagc 

ttgctgtgagccgagatcgcgccactgcactccaacctgggcaacagtgcgagactccatctcaaaaaacaaaaaggaga 

atactgtgatcatcttaatcaatggaggagataaagcatttgataaaatttaatacccattcatttttaaaaaaatttct 

tagaaaactagaaagagaaacttctggaacactgataatggaacaacataaaaagtcaacagcaaaccactttcttaatg 

gtagaatgttagaagtttgagtttgggaacaagacagagatccactttccacagtttcaattactcagggtcaaccgtgg 

tccaaaaacattaagtggaaaattccagaaataaacaatttataagtcttaaattgtactccattctgagtagtgtgatg 

aaatgtcgtgccatcctgcttcgtcccacctgggataggaatcatcccttggtccagtgtatccacgcccctatgctacc 

cacccactagtgtcttggttatcagatctaaaaatcatagtatatgtagggttcagtactatccatggtttcacgcatcc 

actgggaggctgggaatgtaaccccacagattaagcaggacactatattgccaaatccaaaggatactttttggttctcc 

tcttcttgacttctcggtgatatcaaccaactgactgcttcatgagcgagactcttccagccgtctctgacccatgctgc 

cctactgctgcttcttagcctgcctattatgcaatcttttctattcagcctctcactgttgattttttcccagaccttga 

ccctggccgtcttctcactctgtgttctctgactggactctgccttccccatctgcagcacgacatgcccagaagtgacc 

aattccatccttcctcaccggtgttccccacctcactaactggcaacatggctctgctggttccaagccacttctcttgc 

tttcactcagcaggcactcccttaccaggccccagagctgtggccgtcagtgcactctccacctccactccccaccccac 

caaccctagcatcacagtctcgcctggaccttttcacttgactttggcccctccagaatccacaaaaagtccagagaaga 

cctttaaagatcatgtatctagccccttaccttcctgttgtatctaaaataaaattcagcgtcctgggtgaactggcgtc 

cgcttttccatattcagcttgcactcttctcttccacatttctgctaccaggaccatctcctatgtcttcaaaagcacaa 

gctctttccctacactgagcatttgtacatgcttatcccattttctggaacactcttctcactacccttcattctgtata 

ctttcataccttgcttgatttttccagacaacaacaaattctccaattctctgacaccaactcagtgtctcacagttcag 

ttccattcagacactacccatagttggagcaaatcctgcaggttaggggctcagtcccacaagactgctcccacttcaga 

cactaagcacaaatgggtccccaggctacctcactgctactgggccaactacaaattcagggcttctggtgactgctcac 

tccaagatttgacaatttgttagaacagctcacagaactcaggagagcactatacgtagaattacagttttattatacag 

ggtacaactcaggaaccgccaaatggaagacatgaataggacaaggcatgggggtgggtgagcacagctctttcatgccg 
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tctctggacaagcctcctccctgcacgtggatatattcactatcctggaagctcccccaagcctcatcattctgagtttt 

tttattgaggtttcattacataggcatgatggatgaaatcactttctattggtgactgaactcatctccagcccctttcc 

ccttcttggaggttgggggatggagctgaaagttctaaccctctaaccacctagttgggttttctggtgaccagcccccg 

catctgaggctctttaggcccctctccccatgagtcattcattagcacacaaaagacactcctgtccctctgaaaattcc 

aaggggtaagctaaattgtctccccttgttcctcactcctgtagaactgtattatttttcatcgtggtattcatcacaat 

ttgaagaatatattcatatgtttacttgcttaatatctgggtcggtagcccaagcttcaggggagcagtgcactgtctct 

cgttcactcctatacacacacagcttcagcatctagtacataatctagtaagatatttgtgggaagtaaacttagactct 

tcagggtaagtgcagagtgaggaggaaccataggcaccaacctctcagggaaaaaagcttgtgtcaatggcttgctttta 

agggatgtaggggctgtattgtagctccagtagacatccttaaattcttggagcatgtgtatggaataagatggccctgc 

tgccaagtacgtacagcctggtgaggaccggccacttctactggacattcagtccctgggcgaccgaatgacactatcag 

aataagtcgaggaagtacttttcaggcccatggcctcttgtgtagtaaacaaggacagatggcattttcatcacttctcc 

cttgtgggggttgaacttctggaaaagaatccggaactttggtttttctagacgatgtttactaactagcgatgataaac 

ttttagctaagaatagagttactttcatggcatggggaagaaaatattttcccaggacaagttaaccatgtctaaaagaa 

tttatttagctttttttttttttttttttttttttttgggatagggtctctgttgcccaggctggaacacagtggtgcag 

tcatagcccactgcagcctcaaactcctgagctggtctcaagcagtcctcctcacctcagtctcccaagtagagctggga 

ctgcaggcatgcaccatcatgcccagctaatattttcttttttgtagagttgggttctcactatattgcccaggctggtc 

ttaaacttctggcctcaagcagtcctcccatctcagcctcccaaggtgctgggattaccagcgtgagccactgctcccag 

cctggtccaaaagaatttaaaatggagcctataaggaaaaggggaaaggtatccagataatcctaagcactctgaagaat 

ggctcatataatatatggaaaaaagaacaaaagtcggggggaaattcgatgattcaaatgagaaaatcaatgtaaagtga 

gttagaaaaaatatttccagactccgggcacagaaaagtaaactctaggtcctgacacagtctgttgagactgtgtccta 

atgagatgcttttccagcctcaggaaggagcctccttaaagagggaaataagagactctcctctgcttgcttttctgtag 

cttcttgtagctggagcagcaagtagaagggactggggaactggtgttgtcaacagatcttcactgaatacagaagggcc 

ctctcaccttccctgaggcatagagggaagcaggcacagactgtattacgggtggacccaagagaggtttctgagtgaca 

ccagggtcctcgcctctcagcagcagcggcaggctcttgaccaacttagaagctgttgccgttggcttcatgctcttcct 

tgtggatttgctgcctccttatgtttatctattagaataaaaataacaaaaatcaggattatgcgtggaagccaagaaag 

actgagaaggttattctcagacctctttactggttcttcctgttctcctcaacttctcattgttagggtggccccaggac 

attctcatctatctgtattcactctcttggtgaatgtggccaggtttattgcttgaaataccatatatatccttatgcct 

tccaaatgccatgcccagcccagacctggcagccagcctccaggttagaatatctgcccaggacatctcccctgggctgt 

ccggtagatatctcagggttctgcatgtctaaaatggatctactgatatgtccagattccaaagtgcttttcacacctct 

gcttccacagctctggtttgagccaccatcatctcacctagactgctggtctccctgataatgccattgcacctctaatg 

gttttctcagcaaacgttaaatcacaacactctgtttaaaaacctgcactggctctttcccaatgccctcataatgcttg 

cagtgccccacatgatctgcccttcaccctcacccttctctctgcctttgtctctggccttacacaccccactccaaaca 

tgctggcctcctgaggctccccaagccatcccaggcacggccttgcctcgggctttgcactgactgctcctttttccaga 

cgcacttttgcaggcgtccatgagccacctcccataccttctccaagtctttgttcaaatgtcaccttctgagaaagaaa 

ggcccacacactctgaccatcctgctgtggagtgccactggtactcctgaaccccttgctcttagcatctttttatttct 

gtagcacttacctaccttttggcatgctctgtaatttgcttatttacttactgtttgtctccacctgctggaatgtaaat 

tccatgggacagtaatctttgctctgtttatcagtgttttccatgagcctagaacaggacctgacccaattaaagtattg 

attaaaaagacaatgtaggccagtcacgtgaggcctgtaatctttgccaaggcaggcagatcacttgaggttaggagttt 

gagaccagactggccaacgtggtgaaacctcgtctccactaaaaatacaaaaattagctgggtgtgatggcgcacacctg 

tagtccaagctactcaggaggctaaaagaggagaatagaatcgcttgaacctgggaggcagaggttgcagtgagctgaga 

tcgtgccactgcactccagcctgggtgacagagcgagactgtgtctcaaaaaaaagacaatgtagactgtcagaagagtt 

ggcagtgtcctgatagcagagaggcccagagatgaaggctttccaccaagtgacgatcctgggggggttggaaggcaatc 

ctattgaattgcctgttgtcagggcacccattggaggagtctgcacccttgtaccccttagggaggaggcagtgagcctt 

gagataagggcagttcagtcatcctagcatcttagagtccctctctccagccagcccaggagggctgtgttggtgtcagc 

agttgggtgcactgaggtgccattcatgaagctgaccccatcctacaaacgcagagctcagccagcaagagagtaccttg 

atgacactggcagagagccttatgtctacatgtagcactctggaaaaataccctgtagctagccccttttacaagttaat 

gagtaaaaatgcagttatagcagttatctatttttttacttgttaagcttattcttccttttaaccgcaagtcagctgtt 

ctctttgccatcactgtagcttttcaaatcattccacacctggtttatacatgagagagcattgaaatcctctgcaacta 

ttaaggccactgggagttattttctcttaatatttttaagaaagttgttggcttgggtttgatcatttcaaatgtgtttt 

attttacttaaaaatatgtatttgtaaaaatattgaatatacactaataattacacacatttcttctgtttgcctcttcc 

ttatgaaaagttgattgttactttaataattaactttttcaccttcttagattttaccagtaaatatctgctaacttgaa 

taaacagcatgactaatatttgttataattatcttagaattatttttcttatttgggataagccactgtgggccattctg 

atgaaggccaagtggaaggtcagggctctctctgaataccacatgggaacgatacagtctagaggcctggttaggaagcc 
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tggcttaccgaagttgtctcactcattacctcatgagccggcacattgttgtgacaaagcatcagaagggagcccggagg 

tggcatgtgggagctgttttcactgcctgggcctgaggtgtgggcaacggtgggagctgattctgggggctgggataata 

actcagtactttcctttctcacagttgtttcctctctttgctcctggtggctgctgtggtttggaagatcaaacaaagtt 

gttgggcctccagacgtagagaggtaagcttcagtgggtaaagattaaagaatccctggaagagctttttttccttcttt 

ttctcttaagcaagtgggttttagctatttagtgataatggacagacagacatctccacggaaggaataatgcagccctt 

tctgcagtccatggttctgcagcatatggtcagtagacttagatgagtgctacataaagtgtaattagttctgcaccaca 

gtgtcttaccttccgaaaaggaaactctggtattgacttggaaatgccagttttatccttcagaccagaacccaagtcct 

gactgaggccagagatccaagggcgtattagcacagccaccgcgtgattaggcaccgccttcatgagaacttgtcatgcg 

agagggaaatcagccatttaagcatctcaaaaatttttcattattcaaggaaagataaatgtgtgtgtagttaagttttt 

aacttgagttttatttttaaataagtcttgatttgtttccagagcttcattccaaaagtaataagcaaatatttacagct 

gacaaatttaaataaataaacttggttgcaaattacaacatatcaaatgcccatcaatcaatgagtggataaagaaaatg 

tggggtgtgtatgtgcgtgtgtatgtatatatatatatttatatacatttgcagcaacctggatggaattggagaccgtt 

attctaagtgaaataattcaggaatggaaaaccacacttcatatattctcacttacaagtgggaactaagccctgaggat 

gcaaaggcataagaatgatacagtggattttgaggactcgggggaaagagtgggagggggatgagggataaaatactata 

catcgggtacagtgtacactgctcgggtgataggtgcaccagaatctcagaaatcaccatgaaagaacttattcatgtaa 

ccaaacaccacctgttcccccaaaaacctattgaaataataaataaataaataaataaataaataaataaatattttaaa 

aataaccaccattaaaataattacaacataaaaactttgtgtattgcaactttcacttgcatcattactttgcctttcat 

ttagtcatgcctgtggcattgttggcattgttgctaatggaaatgctcacgtgtatataagatctagacaaagaagctgt 

ctggggtttttttggatctactcatgcctttttctatataaaaatgtatttatagataagatttagaacagaagggaagt 

atgtaaaatcacaactcacaagcactcagcactgttgataggatattcatgtctgaatagggttaagacaggctccagga 

tatgggcctcttgttgtgggccacagtaccaccttttctgacccacataaagatgatgttatcatagagggaatgttgca 

cactctttattattattttttaaataagtggataacatattcaaagggttttttagataactgtattactgatatacttt 

aaaagcctaatgaattacgtgatgttctgaaaaaaattatattttcaatttaatttaaatgtatatttattcacaatgga 

gtaagaaagaagtaagaagccacagtaaaccagttatcaagaaaggagttgaaaacttttgagtg'ctactctctccaaga 

aggacattagacccagggagtttcaaggttgagatgagcttatggagatcatgatagcctgttcctccctgagtgcttat 

gagaaataattgtttctaaaatctgtaaagcaaaaggacagatggaaagcttgaaagagtccgaattgattttatacaca 

taaatcttggtaccaaaacctgacaaaggtagcaccaaaatacaaagggcagctgcatagtgaatgtatgtgcagcagtc 

taattgcagtgtaatgaaagagtcacttaataccagaaaacctattagaatatatcatgtcaaaaaaagaatatatcatg 

tcaatggggaaaaaataacacctgaattcttcctagtggcccagaaatcatttcgtcatttattgaaactcaatatccat 

tcctaatttgaaaacaaaactcacagtaaaatcaaaaatacctttttaacgctatgaagaatatctggataaccagcatc 

actcatgataaaacaaacaaacaccagaggttgtatccaggctagttagttttaacaagagctgacatctgagcatttaa 

aaattccaggcactgagttctaagtggttttacgtagattaactaatccttatctcaaacctatgaaataggtattgtcc 

ctcgttatgttttatggatgaggggactgaggctcagaaaggtgaagtgacttacaccacatagttaactaagaggcaga 

cgaggctttaactagggacttctgtcttcacagcccaagttcgttatcttcgtgtaatcctacagtcattcagctttgtg 

ctagatgttctagactgtgcactaagtgaaagggcactaagaagtgttgttactattggaaaggaaacaaaattattatt 

ttcatataaagttattaaaacaatgtagtagagatttgcctgcataaaaatgaaggcttaagtatatcaaaaaataaatg 

aataacacagaagctaaacaatcaactggaaattgtttgcagtgtgcatgagagaaagtaagatgcacagcctacaggaa 

aaagattaaaatatatatatgtgtttagtatggagttggtcatctaataaactatgcttttacccgtgtgatttggaatt 

tctctgtaactgtgaagctgaccattcagaatatatttcatcatttttaaaatgtcagccattatttattccctctcaga 

attgtagaagcaatgatttgatgttatttttaaaaacaatagtagtagcggctgacgctgtttagtccttgctacacacc 

agccccggttttaagtgcttttacacatgttgcctcctgaaccctgtcttattgtgaccccagcacttaagcacagaccc 

caaggagcattgctgcatatattagtcgagtaaatggacctacagctttaaaaaatgtaagaatgttggtattttaagaa 

aatatttataaaaatgcccttctagaacagagccatatctatccaggccaagattctgtcttctacagtagttccaagga 

ggcagtttgaagtaaggcctacttgtatcccaaggcataacctcaaagcttcagcatcatgcccagaaagatctctaata 

tccttaatgatatgtgaacggctgttcaccatgtgctaggcgctgtgctgtagtatctcatttgatcccagtgacaacct 

tatgggctaggaacacatttcagagggaggacacggatgcccagagacttcaaacacctttcttaaggtcacacagctgt 

agactattggagcaggagttcacatcaagcctccctaaatcccgagcccctgtttgtccctctactctgcaaaatttcac 

atttgtttgtttttagcttataccacctctctgaaggaagggttctttcagttcctatctgccgtataatgaggcatagg 

tttttgttgtcctgtgaatccttggtgccagtttcagtaggaagaaagcaagcctgacttbagtagtagaaatacgagga 

gatagggccacaccattttatgtatttgacaagtcgacgtcatagcattaaatccacagtacttttatttttatctttat 

tttcttattttactttaagttctgggatacatgtacagaacatgtagatttgttacataggtatacatgtgccgtggtgg 

tttgccgcacctatcagcctgttatctaggttttaagccctgcatgcatttggtatttgtcctaatgctctccctcccct 

tgctccccacccctgacaggccccagtgtgtgatgttcccttccctgtgtccatgtgttctcatggttcaactcccactt 
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aagagtgaggacatggagtatttggttttctgttcctgtgttggtttgctgagactgatggcttccagcttcatccgtgt - 

ccctgcaaaggatatgaactcatttattttatggccgcatagtattccatggtgtatatgtgccatgttttctttatcca 

gtctatcattgatgggcatttgggttggttctaagtcttcgctactgtaaatagtgctgcaataaacatcgtacgttaat 

atgtatctctgacaggggcttttaaaaaatataaggacaatggtgttatcacatccaacaaaatttagtaatccctaatg 

ttatctaatatcaagtctaaaaagtatcattttggacttgatttgtttgaatcagggaggatgaggtgtttttaacatgt 

agcattaactaataaaaccccttcagtactcattccattaatggctttgccatattatttatcagcaacttcttcgagag 

atgcaacagatggccagccgtccctttgcctctgtaaatgtcgccttggaaacagatgaggagcctcctgatcttattgg 

aqggagtataaaggtgagaatgtgactcagaagtccctataacttgactttttaaaacttaggctcctaagtctgggaaa 

ccagagagagcaaaagccctattccattacctcctttttttcctctctttctttttaacaataagctaaaacctgaagtt 

gctgggtactcttttgcttttgtttttcaatctttgtttcatgtcgctgagcagtcttgtgttcaggggattttaaaatc 

taaatgatttgtctgcttttcttttaagtattctgtggcatacaaaacttttatttgaaaagagctacattgaacatcga 

atcattgtttttttgtttgtttgtttgtttgagacagagtcttactgtgtcacacaggctggagcgcagtggcatgatct 

cagctcactgcaatctacctcccgggttcaagcgattctcttgccccagcctcctgagttgctgggattaccgatgtgcg 

caaccacgcccagctgatttttgtatttttagtggagatggggtttcaccatgttggccaggctggtctcgaactcctga 

cttcaagcaatccacccacctcggcctcccaaagtgctgggattacaggtgtgaaccaccgcacccagcctatttttctt 

tatattttggggaattttttaagcagcaaaatgataatacactgtgtattaatactctgtggaaaatatttatcaccccc 

ctccccaaaatgatgaagaaaatggtaagactttctccttgggcaaaatgatggaatatttaagatacgctggagaaata 

gctatgtatcttgaataaaacatacttacttaaaatgttactgagctcacaggaagtaggtaaaatctccagggaatact 

ctccttccctaccctaaaaagaacaaaccataaaccaaaaccagaaccatgatttgtcctaagtggttccagatggtaaa 

cacaacgcccacctgaatagatggaaatcctctctatagaaaaacatcttcaattcagcgctgttgaagtcccacagatc. 

aagatgaagtaattaaagatcaccagggacaagaggcaagccgccataaataagagtagcagaaacaacagatgattgat 

gtggacaggactcbaaatgttagaattgaaatctaaagaatataatataactgtgtatgaaatgtttaaagaaataaaag 

atgaaatcataaagatgagagtcaagtatggccaagcagatctgaagaagaaccaaatggaacttctagaaatgaaaaat 

atacttgttgaaattttttaaataaataaactcaatttacatgttaaacagcatatcagacataagagaatttacaaagt 

ggaagatgaatctgaaaaaagtacgcagaatgcagctcagagagacaaggagatagaaagtacaatagatactaagaata 

tggaggatagaatgagaagatccaactcatatatatttccagaatcccagaacaaaggagaggcaatattcagaaatgat 

agctttccagaactgatggaaaacatgaatatacagattccagaagcaaaacatattgtaagcatgattaaagaaagaaa 

gggaaggcaggaaggaaagaaaacaagttggatttctcataatgaaattgtataactccaaagacaaagagaacatctta 

aaatcagccagagaaaaagtatcacatacaacgtggagaaactgaacgttagtgcactgttgatggaagtataaagtgat 

ggagccactgtggaaaacagtaccactattccccccagaattaaaagtagaattatatatgatctggcaatctgggttta 

tacatagaagaactgaaagcagagtctcaaagacatatttgaacatcagtgttcttggcagcattattcacagtagccaa 

aaagtggaagcaggcccaagtgtccatggatggataaatggataagcacaacgtggtctatgcatccaatggaatgttat 

ccagccttccaaaggaagcaaattctgacccatgctaeaatgtgatgaaccttgaagacatacrtaagtgaaatactccag 

taacaaaaaaataaacattgtatgattacacttttaagagatacacttgtggtagtcaagctcataaagacaggaagtag 

aacagtggttcacaggggctgggagaaggggaaaatggggagttagtgtttaatgggtacgaagtttgagttttacaaga 

tgaaaagagttctggagatggatagtaatgatggttgcacaacaatgtgaatatacttaataccactgaactgtacattt 

ttaaatggtcaagatggtaaatgttatgtgtattttatcacaatttttgttaaaaatgggaaaagaggccgggtgtggtg 

gctaacacctgtaatccgagcactttgggaggccgaggcgggcggatcacttgaggtcaggagttcaagaccagcctggc 

caacatggtgaaaccccatctctactaaaaatacagaaattagccaggcatgatggcacacatctgtaatcccagctact 

tgggaggctgaggcatgagaatcatttgaacctagaaggcagagttcacagtgagctgagattgcaccactgtactccag 

cctgggcgacagaacaagactctgtcgcaaaaaaaaaaaaaaaaaaaaaaaaaagggaaaagatcacctgcaaaggaatg 

acaagctgaccgctaacttctcagaagtcagaagagagtgagagaatgtctttgaagtactgaaagaaacctccccacct 

agaatagtgtgcccagtaagcaccttccgagaacaaggatgaaataaagatatcttcaaattagaataaaaattgagaat 

ttgccaccagcagacctgcacaaaagaaatgtctgaaggatgtgctaaaatgaaggaaaattacccccaggaggatagac 

taaagtgtaagaaagaatagtgagtaaataaaagtataatcatatatagaaatctaaacaaacattatcctggtaaaata 

atatttgtgggttaaaaaatagacaagcctaaagtattgaacaacatgatatatgtcagcagcatatgacacagttagtt 

gctccttaaaacattttcttgattttcttccaggacaccacacacttgccttctagctgttcttgctcagtggtctttgc 

tcagtcctcttcatcttttgacagtggtgtgcgtcaggccctgcctttgtccccttcttgtgtcttttacacccacttct 

tcagtgatcttcactatcacaggtttaagtaccatgatcctctcacctggactgctcccatgagcctccaacacaaacat 

cagttgcctactcagtgtccccacttggatatcttaacatggccatggccaaactgagctttggatgtttatttctcaga 

cttgctcctctgacataccttccgcctttcagcacataacacctccatcaccctagttgctcaggccaaaatcgtggaat 

tgttcttgactcattcttgatgtcatgttctcattggacagccagtctatcagcaaatcttgactccgtattc'aaaaatg 

tttcccagccactgtgtcccatctctgccactattcccacttgatctaggccaccgctgtctctcacctggattcttgca 
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gttacttcataactgatctccctgcctctgccattcacccatctagtcagttccttgtgccacagcagagtggtactgga 

aattattttcttcagccaaactgaagaaactgatgacatttgtgggtatagtcagcagaattctaaaggtggccctcaag 

atttccatcccacgtttattcagtcacacactactctaggtactgctagagggattttgcagatgtaatgaaacaaggga 

gttgactttaagatacagaaagtttctctggctggtggcagaaagaggaagccagagagattggaaacatgagaaggact 

tgacacaccatccctggtttagagatggagtggccacagaagtttttttgtttttgtgagacagagtctggctgtgtcac 

ccacactggagtgcagtggcatgatcatggctcactgtagcctcaaactcttgggctcaagggatcctccctcctgggtc 

tcccaaagtgctgggattacaggtgtgagccactgcgtaagccctgaatgcatttcaaagcagagtcttccccagagtct 

ccaactaagagtcccatccagctgacaacttgacttcagcctgagagacctcaggagatagctgataattccccagaact 

catggaagacatgaacagacctgtctctctcggcagagaaccctacagagcttgctgggacttctgacctctaaagctgt 

gagataataagtgggtgttgttttaagcaactaagtttgcagtagtttgttacactaatataagctgggtgcggtaggtc 

acacctttaatcccagcactttgggaggctgaggtgggaggatcgcttgagcccaggagttcaagacctgcctggacaac 

atagtgagaccccatttctattctttaaaaaaaaaaaggaaaggaaagaaaaagagaagaaactaatataggggacaaat 

ctcatctaaaggctgcaacagaagatgaaaacatattttccagagccaaatgcatccagagagtagacaaagaatctact 

tggtcagccaacccagcttttacaagcttttacatccagcaggcttgtctacttagtgtctaaaggacagacaaaataga 

atgttttttaagtctccatgatatctgcatctgtgtttgagccaggtatgcagaactatctgacaaatctaccaagcact 

caccactgccattcctgtgacctacagttatgaattagaatttcaaatcgagccaaagtacagcatccagaagaaaccca 

cttggcgtggccccttccctacagttcaagcagccctctggaaggggactgcgagacctcagtggaaggtagcaagcagt 

accacccgtgtcactggtgctcagagagtactgttagctctctcctgccctttttcattatcttaaaaactgcaatgtac 

cttttttatatacctgaaattgatggttcttataagcacaaatgaccgttaagttgtccgtatacattaaagcttgagta 

tcacttaaggtaattttgttttcttctgccagactgttcccaaacccattgcactggagccgtgttttggcaacaaagcc 

gctgtcctctctgtgtttgtgaggctccctcgaggcctgggtggcatccctcctcctgggcagtcaggtgagtagatgcg 

gtccagcgaaagacaccttctaagcatgtgagggagctaagcatgggatacttccctttcctagagaacaaggttataaa 

ggbgataaccaacagtgcctgcccatgccatggcaaggattcagtgagataacccattttcacctatatgcccaaaactg 

ttttcccgaggtcatcgtcccttcctggggggacgtcccgggacctccbcaccgttctggcctccccagcctctcagtgc 

tcgtggatgcctctgaccacaccttcctttgctgtcggagtttctggtccccctttgcctctgctgctctttccctgtct 

cttttgccctgtcctgcccctgaatctgggtgtgctcccttcaccctgcatgttcttcctcagttgccttctccactcca 

caacttccagccatctcccctgggctggtgtttcttcaattggtatttctagccctcctgcatttcttttgctttttcac 

tgttttaaatataaggcacaaacacaacgataacataaaggaaacgtctgaaaatccacgcacagttctgctatcccaca 

acccaccaattttcattgttctgtgttctttccagccctggccagcaatttagttgaaatgattgtatagatctaatttt 

tattctgcttttttcatataactatcttaaacatttttatagcctactacataatattttaccaatgcatgtgacacttc 

tccatcaaatatatgaatcatgattatataaacttctattttgaagacagaaaatatggaatacagttttgaaagtatct 

cgcttaattacagagtgctttggaggatgcctcaatgtgatcattttagacaactttcagttgagatgattctggagtca 

gggttaccttagccagagagcaccctgtgtctggtggaaaggctgccccaggatactggtccaggccctgagctccactg 

tcccttcagtgcaccctcagacccagaagatcctccagcccagctttagaatggttaggatctatagtatcttcacgagg 

gaggcctttcgacatggctcggccaggattcattgtcttcatctcacctagggattcctgtgagttgagcccctaactcg 

gccttgcctgtgttgaggacaggacaactctctcgtctgtcactcaagtgcagtttgtgtctcctgccaccgcccagccc 

agcaaacagtggctggtgtgccctgagggcagggtggcggctggggcggcaaggtgacaggtgtgcagccctgtagcctt 

tcctgctccagcctttgcctgacatgccccaaccacctgaaacacctaataaaggtctgctcaatgcttgagacccaccc 

aacagtggggccgtccttgctgccccttctgtaccccctgttttaagtggtgcccttttctaggctcccagagcattccc 

cactgtgtgccaggcacttacagctgtctacccccacccccactccccttcacaggctcaacaaggcaagtaacacgctc 

cactaacacaactcctggcaggtctcagcagagctccgagcctgcgctggccatttcccacgtatattaggaatccagct 

cccccagaaccgtgccttccctaatggatgtcagattactccgttatcaatacaggaaggaaaatgggattgcaagtccc 

caaagcaaagtgggcggcggggcagtgatgagcataaagggtagttgctattgatcgtgggttgctgcgcgccagatact 

aagactttatgcatgccaggtactaagactttgtcagttcactcattgattcttcacactagccctgtactactattatc 

cccattttacaaatgaggaaactgaggtgcagggaggttaaagaaccttgcttattccaaagcctggccccagagcctgc 

cctgcctgtatcgtggtatggaatggaagtgactaatgctgactgagtgataggggtgaat,ctgaatctcgtcataaagc 

gtgctgctatttctcggctccttttggaggaaggcaggagaaattgggcacttagcacacgtctacaatctcatttatat 

gttcccaccaccaagcagagcactgtgcttggttgatgtgctgaaaagcaaactgcaccttttgtgtaaaaagtagggaa 

aggagaagaatgtgtatttgtatttgcataagtataccctcaaagaatacataaaaattagtgaaagcgtttacccagat 

ggaggggaaggcggagggagggtcagggtggaaggaagacttttccctgtgtatgtgtacctttttgggggttttttgtt 

ctggttttttggtttgtttgttttgcaccatgggattgaaaaataaatgtttaaattattttttaaaacaaaataaaagc 

cagctgctcccagcatatgttctctctgtggttctcccaaggtcttgctgtggccagcgccctggtggacatttctcagc 

agatgccgatagtgtacaaggagaagtcaggagccgtgagaaaccggaagcagcagccccctgcacagcctgggacctgc 
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atctgatgctggggccagggactctcccacgcacgagctagtgagtggcacaccagagccatctgcagggaagggcgtgg 
cggggaaatggctgtgcggtgcgggacggaagactggaaaccctcaaagcatctgactcacctgcatgatcacaagcttt 
ctttgacggtttctcccatccgtgttccagcatctaaccttttacttttgcataggaaatacttgatttaattacaggtc 
cagggatgagctgatggttgctggaggaggccagtgtagagccagtgagagaactaggaatgacactcaggttcactgtg 
gaaaactgttcttgggactgtctcaactgtgcaaaaaacaaaagatggagtgtttacaagtagacattcgtcatcagttg 
ttcttgaacatggtcttttaaaaactagtcagatgaattaacttgttttcatctgaagcctgctatcttttttaaaagat 
gtgctatttattcttgcacgatttaggcaattatctctcttccagggagtacctttttttctagttgagaattaataatg 
gtccatctcttttgatcatatcaagctaggatagaaggggggctattttaaatgtcaaggtcagcagtgttactttgaat 
gtaaactggtataataggtagttttctatagtaacttgattaatttagtcttaatccatttgaaactctctcttcctttc 
tctctgcctgtccctctccttctccatctcaccctccctctctcacacatacacacacaaacacatacacacaacactaa 
gtgcctagactttaaatagatctagcaattggaaagttagtaagcctaagtttttacataattgcattcctacattcttg 
taaaatttaaatagctaccattggcaatctgctttttttctaaaatctgatttgcagccaggaaagaattttctcaccca 
aggaacatttgatctagcagcagggatgagaggaaagcagaaatgaatgaactgtgaaagctcctgtttttattatcaaa 
aaggacactgtcaagaaggcgccccctgcccccacccccgtgtcaccctaggcctgataagcgatcagaggaaaggactc 
attcatgtcacgcttccttgagcagaaaagagcactgagagcacttgggacccctggatcagagagcatctgtgtgtcct 
gcagcctcctctgaacttgtggttcattctcaggctggggtggactcagatgccaggaaagggacagcctcccattgtca 
ggcagaagctgcccaaagcctggagaaggacttgtttgccctctttcccccaggaggggctcgacccacccaccctccct 
ctcagaccaaggtggtggctgtgaggagggcagcaaatgctgacaaggatgaaaagcacatggaaaaaaatggacgagga 
gggaaaactctgccaaatggaaaatgaccaaatttaagagggtgggacagtcccctgctcctctcccagagggcactgct 
tggaaattgtgttttccccatttatggtgctctgtattctggcattatgcagcagcctcccagaagctctcttctgcttc 
aaaacctgggatctctggcattaccctattgggatggaccgctggacagcaatgctcgagtttgtgaatttggagagata 
ctcaaaagagctaaaactgcagcattttacctttaaatgcagtgcctagagagagagtattgtctcttccccaacactaa 
ccccactcccatgaagaattgcctggaaagatgttttcaaggaatttgaaccataaaacactatctgatgcacagaacac 
ctctactttgagactcacctctcataaagcttctttttcacattactgttaaagaccagacgttctagaaaagacccctc 
ctctcatgagctcccccatccctgctacagaacacagcacccatggcgcctgcagtggactggccccttaattcccacag 
gcccccccagcaaggccaaagggaggcccctgggtattgtcctcctacaaggaagatcctctttgtttgttcaaaggacc 
agttttcctaggccaaagaagtctcttccccatgttagtcctatgccttgaaatatcatgcaccatgacccacagccatc 
tggttatgtcttatttttttcctaaaagataatgtttatttttaaaaaggaaggaaggagcaagtgaagtttcattctgc 
tccagcggtggggaagccgctgaatccacctgcttctcctttgcaaccgacagcaaacagctttctccggcctcagggca 
gaaaaagggaatggcagggagtaagaggcgctgggctcggagcctgtttccaagaaggaattggttgtcatctggcagtg 
■ttgcgcgtcacaagagagcctgtatataaattaaaatagtcaagacaacactgaccttgcacttgtacataactatacag 
tagtgtccagaatgttcagacattcggagtgtacataaaacagaaaaaatcttcatgtatttttattaaatataacaatg 
tctgagtttcacctaagatgtttttgtgccatatgctggatatccaggttctcgccaggccccgatacatgaataacaaa 
cccaagaaacgcatccccattgtgtgatgtgttcagatgcatctggcaccaattaggtatttcttaaaacaggactcatc 
tgtcagagtgcacatgaaaaatcaggcagggaatcgaaacgacagcgctggaggagactcaggaagcagaggcgtccctg 
ccgctgcccttggccctgcaagcacatcatgaccctttctggcagcctcttggtgctctgggtagtgagggatgaccagt 
cttgtcctgagaaatgtttctcttagtctttaagttcaaagactaacctgtagcaatcagactttccaaaagggggttct 
ccattttttgtagttttgtctaaatttttaatgaccatttcctggaatcagtttattatactgaaaactgggggtgggag 
tagggagctagtttgctgataaatagttcccatttccccgtggagaatttgacataccctggactcctgtgtgcctcctg 
ccatccctgcacacagcctggggagaagcctgtgcctccccgtgtggagagaaggcaaccccagatcccctgagctaacc 
cggaggaaaggcagtcctggacagaagactgtcagcagaaggaaagtactggactacccgtgggtaagtcctgccattca 
agactggagacacctgggaaataaaaagagcagggcactgctggtgggaagaggcattttaccttccagtgcaaatcctg 
ctcctttgatttaatggggtgtactggggccaggggctgattcacttccttgggagatggtggtgttttcatgaacatct 
ttgatccttccatttcatttattcatccatccattcaacaagtatttgctaaacactaacttaagctaatgctagggtag 
tgactgagatgtaaaaatagattttagaattaaaacaaaatccaagtcctcacacccctgtcatcccaggagatctttcc 
ttgtggtggtttctgtgagaattggccatcctgaggacacagccaggacggcagaggcctcctggcctcagggcatgccc 
tgcctaccttctgaaatgtttaccccattgaccaaacttggctccagccattgcggtggtttctagatagccaggcccac 
caagagatattgccccttgatgagagtcaaacaccctgcctacaaggagatgttttgaaatggagaggaaaattggcacc 
tcatcttttaaaggcagtaatggaattgattttcagtaactgaatttgtgcacaaaacattctaaacactagtgaagcct 
gtttcgttgaactaattaattctggctctggaaatgtttttgttttatagttatttacgatttcgtttgtttggattcaa 
gcttagtttgttaatatgtataatttagcatctattacactcatgtaaatatggagtaagtattgtaaactatttcattg 
cggggattgtgggtgttatacatacatttaggactgcaattttttggtatttttttgtattgtaaaataacagctaattt 
aagcaggaacaagagaactaagggaggtctgtgcattttaaacacaaatgtgaagaacttgtatataaacaaaagtaaat 
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actataatacaaacttccttctgaaataaaagtagatctggtaaaaatgtggcttttgttctgagtgtttcattttgatt 

ttgcattgttttgcctatatttacattttggtcattagaaccttaagggaaaaaaaaaagcgtttaccagttttcagact 

gcgtcagagggagcacttgacagttaacggaggtgagggtgaaaacccctggcaggtacctcagacagccctccaggcag 

cttgtgagccacagaccttccacacagcctttgtgtgcctacagggcagctgggcccgtgggggcagggacagatctgag 

cagtgaggtatggacagccgtctcactgctccactaaccacaccactgtctccagtggtgctgttatccctccactttgc 

gctctcctcacaggctcctgaatgtcctctaggttgtctccaaactgctaaatctcaggtgtgctgacaggtctcagccg 

tggccagtgttttcctcggccttgaagttttgaggaagttactcccattggcaaggccttgtttgggggcctcctgtgtg 

tcctgcccccttttatttggctcctgccctcagaatgggtcagagtcatgtatttcagtgcagagtgcagattcccagtg 

ctagcagttccttccagaaggatgaggtcattgaaagaaggaggaaacacaagacggtcactcctgtaggggaagatggc 

agggtactagggccagggcctgaggtgaaggcaggagcctggaggaggcctgcctaccagagctcaggtctggcttaggg 

agggaaatggctggactagtggctgcaaacaagttgagaaggaccttgaaactcttccagggagggttatgagagccacc 

aacagtaacaggatagaaaccctgtgatttggtggtactgtggcagagacaatgctgtgtttaccaaacctactgggccc 

acagctcatcatttcccagcctccctgcagttaggttgggaccaagctaagccatggccaaggaaaggttggaagaagtg 

atgggcaccaattccaaacgtggcacacaaatgctcccacaggggactctcctacggatcacatgttaaagactggaggg 

actgtgggtccctgagtcaccccttagaggccggccgcctgatcagtaagagctgtgctggcctttgcatcactggaaag 

caaactcactctgctgatccactgagatttggggctttgttacactagttagcttttcttattctcactaatatacaaat 

cagcactttgaagtgcagtgcacatgtaaccaaacccttaagtacatggcactggctcagcgtgtggccaggtggcgagc 

agtgataagacaggtgcagctggcaggacaactgaggtccctgccaggcggagcaaaacatctggtaaggccgtttcact 

cgataactcggaaaacaaactccatgtctgctgactttttgacttgaggggaataaacaacaacaacaacaacaaatatg 

tgttgattgttactggctgcttttggcgaggcatcacgagaaaaagtttacttcaagaaagaattagccagtttacaagc 

agaagggaaaaagatggaaagatgttaaccagggccttgcaattctggaaaagccaactgcttctagtctccaaccagta 

agaggtgcaggtgagcaacaaaggccccatgagacttagttgaacacagtgactgggcctcatggtgaccatccaatgaa 

qggtgtggtcttcccactggagcatggaaactgcaaagccaccagcttgagggaggggcacgtggttaaggaagcaaaga 

aaaagctgacatgagaactgtttccaggaacaaaccatgaatgtagtcactgaccctggacctgcctggaaataaataaa 

gatgtagtaaatttaagaaaccattagcaaagccttaaaaaaaaaaaaaaaaaaaaaaaaaaacggtggcaattgaggaa 

gtcttaaatccaccttcaggcaggaaaatggttgtgaaagctgtgtccagcgagggctttacaagctcagatgtgggccc 

aagagcagaactgggcctgagcgaggactccccccagcccaggacagggcccagcaggattgcaaaaacaccgcagagca 

gcaattgctgcatctgcacccgctccttcctgggtaggagggcctaggagggtgtcctgcctgttctcatcactacctgc 

tgggtgggaggtggattaagctgtcttgttaattcaggattgtggacttcaaagagccacctgagggaggctgcatcacc 

tggaggtctgggcggggtgtgacctcgtattgtccccctaggagagggtgcctgtgacccatgtaggagaggaagcacaa 

aactggttttggtgatcagaatagcaaactgggccagatgcttaccacacatttcctcttcctgagtccacagcttcgac 

tgcatggcctgggcccgtggcatctacccagatgccttgggaccagccctcaccaatgatgtgtcatttcaaggccattc 

acagtgacagtgtctcctccactgtctccatcaaccttgaaggcttatggtgaagatggtaatacaacaggattgaagga 

gactgggtccctgaatgactttgtggaacagagcactataggccaaaggtagtagaattaggtatcttgttttatgctgc 

ttaagtgaattaattaattctctagatttattcacctagttattcaacaaatttttattgagcacatcagggctagggca 

ttgtgtcaggatcagagattttgcaatgaacaaaacagaaaactccatgtcctccaggggagttgataataaaatgtcag 

ataagctctgtgaagaaaggtaaaagtagggttaggggtagaaagtgctggggggattgctacttaacatagtgtggcca 

ggggaagccattctgctaaggtgacagctgagcagaacgatgagtggagggggcaggaaggaatgtggaggtctgggaga 

agagcattccaaacagggatggcaggtgcagaggccctgagggggcagcatgcctggtaggcctgctggatatttctgag 

aatgaaaaagtccattcccagttgtttgcgtggagaacctaacccagttataaacacaaagtacttgagagctggtttaa 

tatacagccagctttccagcagtgcaactactgtgtacatcaagggaaaactgaacttcgttttccttaaaacttatcat 

cagctggtcatcattttgacaaattctgtcaacaacagcagtgtcattcctggcatctgtatgggtcacgtctgaacaga 

cacacgccctgcagccctgcaggtaccagctgtataacaagaactcccttccaccctgtgtcctggaaacaagaaagcca 

ttagaccggaagatcccgatggctatctcaaatgtgctggatggagttgccagggcccactggcatgccctgtaagcctt 

tccttccacgtttggttcctgccccttgaagactccatttctgagtttgtgtgtgttttactttctagtgtgtgtcctca 

tcttaatttttctctctctcttctgccttgaactgaaggttcgcttgggtgtggagagacaggcccccagcagagcagct 

tcccgagacatcctccgatccagggcttcccagcagcccggcaaggcagggctgtgcctttctgcttcagctcacaagca 

tgccaggctcactggcaagctgctgtctggttgagggactgctcctaaagccctgcacagcccctgtcctcctggccctc 

tggaaattccacccccgtgtccacatttcatgcaaaaatgagctggttctgtgagcatggcccggcctgactcgcttagt 

gggcggtaagtggtttccacttcaaccttgcacctaatcaccgggctccacaccaggatggacattcatgagccgtgaag 

tttccagtaataaatccacagatgcttccagcacctgccttttcgcatcacctccactcccagccacctgccaggcaaca 

ggtaacagagacccagtcacaggagggcagtgtgggggcaggactgcagtctcccaaagcccatgcacaaaaccgacagc 

gccctggcaggacaaggaggctgacattcagatgtggaggaacaaggcatgacccattcctggtcatgggggccacagct 
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ggactcagccttgaggcttggccagacttaacaccgtgtataaaccaggacctttttaggtagagtaatggaaaccaaac 

tctaatgatcttagacagtgctattagtctcctggagctgccagaacaaattaccacaacttcagtgtcttgaaacaaga 

gaaactgattctcacagttctggagaccagaagtctgaaatgcaggtgttgccagggctgtagtctctggagactctagg 

ggaatctgtgcctacctcctccagcttccagtggctcctgacattccttggcttgtggctgcatcaccccaatccctatc 

tctgtcttcccctggtcttttgctcaaaatgtctgtgtttagtttccctgtagacacctctgcatcactctcataagatg 

cagaggtgcgacatacaggtgttgagagcccacttagataatccaggataagctcctctcaagatctgtaacttggctgg 

gtgcagtgtctcacacctgtaatcccagcactttgggaggccaaggcgggaggttcacttgaggtcaggagttggagaac 

agcctgggcaacatggggagaccctgtctctactaaaaatagccaggcgtggtggcacacacctgtggtcccagctactc 

aggaggctgaggtaggaggattgcttgagcctgggagtttgaggctgcagtgggctatgactgcaccactgaattccagc 

ttgggtgacagagtgagactgtctcaaaaaaaaaaaaaacataaaacataacttaaatcacatctcttgccacagaaagt 

aatactcttttgcctacatataaggtaatatttacaggatccaggggttaggatgtggacatatctttgggaccactgac 

agccatgaagcaatctcataattttcaaataggttctgtcctttttatctttccagtcttttggaaagcatatgcctata 

ttttcaatccacaattctatttttatttgaggtcatttcatttctggtttttattttttattgagacagggtctcactct 

gtcacaggctggaatacactagcacaatcatggctcactgcagccaacttctgggctgaagtgatcctccagcctcagcc 

tcctgagtagctggaactacagacacacatcaccatgcctggctgattcattttttaattttttctagagacaggctcta 

tgttgcccaggctggtctcaaactcctggccccatgcaaacctcccgcttcggcctcccaatgtgctgggattataggag 

taagccgccttacccagcctccagttttattgtgttttgttttgttttgtttgagacagagtcatgctctgtcaccaggc 

tggagtgcagtggcacgatctcggctcactgcaacctctgccttgcgggttcaagcgattctcctgcctcagcctcccga 

gtagctcggattataggcatgcaccaccacgcctggctaaatttttgtatttttagtagagaccgggtttcaccatgttg 

gtgaacacaaagtatttgagagctggtttaatatacagccagctttccagaaatgcaactactgtgttcatcaagggaaa 

actgaacttcgttttccttaaaacttatcaccagctggtcatcattttgacaaattctgtcaacaacagccatgtcattc 

ctggtatctgtattggtcacatctgaacgacacacgccctgcatgcagccctgcaggtactggctgtataacacgcctgg 

caagaattcccttccaccctgtgtcccggaaacaagaaagtgattgtgatccactcacctcagcctcccaaagtgctggg 

attacaggcgtgagccactgcactcggcttcctcccgttttttttttttttcaatgcttatattttactctaattaactg 

agtcaaaaattgagaatagttgaatacactttcatgtaaggcgaatcatttagccgatacttaactctgcatttgggcta 

ccatgccgctgtggttagcggggcaaggtgatgagccctgtctcaacacacacaccccgcctctccccagcccacttaca 

cgcgtccattcccacgcaggtgtgggggccttagaggattccctcttcttcgtaaagtgagaatgggctggactcggctt 

cactgcccaacaactcctttttttcctttgggaaactgtccttcccattccatataaccttcatgggctgagatcactca 

ccgagctacaggaagaggcccattactatggccccatcagggttctgcctgggacagtccataaatgctggaagagagag 

gtgcctttccctactgagg'ctgctaaaaggagacactgcaaactggggctgctggtggcaatcttacactctcagtgaaa 

gcctgcctggctgcaggggaaaccaatgcacagaccagcaggggcaacatcatttgaacccctggagacagctgtgcctg 

aagcccacatggctcaggcacatgctcaagccactctgatttgcttgttacagtcaagagagtccttggccgggcacagt 

ggctcatgcctgtaatcccagcactttgggaggccgaggcaggcggatcacctgaggtcaggagttcaagaccagcctga 

ccaacatggtgaaaccccgtttctactaaaaatacaaaaattagccgggcatggtggcatgtccctgtaatcccagttgt 

tagggaggctgaggtgggagaatcgcttgaacctgggaggtggaggttgcagtgagccaagattgcaccactgcactcca 

ggctagctaacaaagcgaggctctgactcaaaaaaagagagtcctgactgaagctgagaggctggtggacagctgtcagc 

aagcagtgtttgtgagtctgatgtgagctgcgaagtccacacctgatttcagagctggtggctctttttccaatcaggac 

agctccaggcctgagttttggtgtggtctgtacctaaccgctgtgtcttaggtcaggatctctagaagccaagcccaaga 

caggagttctcaaatgccgtgatttactgagggaatgccttcgggggaacctgccagtgaggagagctggaggaggcggg 

caggggcgggagctgagccaggtgggggtccctgctggagtctagcctcagcttgaccccacaggggctctggagcagga 

acagcacactgcattgtcccttgaggcagtcactggctgcagttgcctctgggagcagagtaaaagtggacaggcatttc 

tgggagtacaattccctagagaaggggacagctctgagctgtgttaccagccaccatttccaggggctgggggatgcgct 

gcactggcccagagaggtctctgagcaaggcccccacaacctccacttcaacctcctgccaaacctcaagcagattgaga 

aggagccgctagagacacaggaaccctttcctcatcagaagtctccaagctttggaggaagaaggaggcaggcaggaggc 

gggaggatcgtttcaccccaggggttcagggctgcagtggactatgatcatgccactgcactactgcctggtgacagagt 

gagaccctgtctcaaaaacaaacagacaaaaaaaatcaatgctcttagcatctgctgggtccacattgggcagtgtgtgt 

gacgtggtccagctgcaagggaggctgggaaatgcagtttcattatctgagtactagtggagaaagaagagacgaaagca 

atgagaatccctgccatgaggaagtgagagctgattcatttcagcttcctaaggtgaggtgatggggcccagacaggttc 

tgaggccactggaggtcacacaggccagcagggatacacccagtggtagaccctggtctccaggctcccaggacacagac 

cacataaaagcaaggtcagtctcagcacacagatttaataagcactgtatgttatgcctacagctgcaggaaaccagggg 

gaagatggccacagggccaatgccaggcagggcacagggtggggcccctcaagcagcatatggggaacaggaaagactct 

gccagcatggtcatggtgcacaggtggctacaatggtgggtaatgcccagttgggaggatgcagtgttagacggggtctt 

gtccgatggattccttagggggttatatgcttgggaggctgatacaatttcctagaattctgggattctggatggtctct 
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gacctgctctaggggatctgggtctccagagaggtctcagctacaaaagtgaccctctcctgaccccaccttgtgggaga 
ccccagtggtctcaaggtctgtgaataaagggacttacaaccaagatgcctcctgacagggagacgggggctttacagtc 
aggccccagcctacatcccttgccccaggggacggcacacagggtgtccaaacctacaaagggcagcggagtgaaagcac 
cagggccggctt gggggg taagccagccccttctcaaggggccagtaggccacagaggcggcccaggcaggcagggtgga 
gtagctggctgtgctatccgaggtcgctgtcctaatcagagcagggccgggagagccaggacaggaagtatggagagcag 
ggagcgtctctccagggccctgcctgtggaggacaccttgggggtgggagccaagtgcagacttgaggcaaaagcctgtc 
tctgccctctgcctgcacccccaccccaggcccttcatgcctccacccaggcccaccagggacggaagggtgtgaaggtg 
acagtgggggaggtgccagctcaccctcccctccctgcacctacctgcaggaggctgtgctccgggtctccctgggggtt 
cagctggtccagcaggactgggggccacccgctggcaaaggcctgaatggcaccatctatggagggaggggtccagggtg 
gacttagctgggaaaccctagcacagggctcagcacaggggacaaagtgagagaccccctgctgcaaagactggctcctt 
ggacagaaggtgtgtggtgaggaataacaaaataataagcacagctgtgacagagtgactcaactgtacagtcttccatt 
ctcccctctaaaatggctcccagccaggcggcttgggagggggcagtaagcgccgcccccactccccctccactcccccc 
cgggcccctcacccaagcagcggttcctggtaaagagcccccggaaggcttcgcagtcctcacgccggttcccgctggct 
ccgcagtcgcaccagggcgccacgcgcgcgctcacgttgtccacgtagttaggggtgacggcggtgcctgcggggaccct 
gagggcgaagtcatcggcccacccgggcggagatatcccctggagaacccccgccctcgcccggatcccggccgcgcgta 
cccacgaggcccgcgtaggcgcgcaggcagcgggcgccctggtccagcaggcagccgtcgggggcgctgggcgctggggt 
gcacgagacctgaaaggccaggaggcgaggcctgcgcggcgggagggcggtgagccggagagaggccccgtccccaccct 
cgccacggccccgccgccgcccgcgcgcacctgcagacccggctgcgctcgcagaagtttaagggctcaaggcaggaggg 
cggcgcggggccgggccccgaaaaggcgcaggagggcacgaaggtctggcgccgacgctcggcgcacgcggggcccgcgc 
acgggcagaagagcagtgcgtgggtgagcgcgggcggcccgcgggcgaagaagcggcgcagggcccggcggcagcgggcg 
cggggacagcccccctgcgcagcccggcccaggcactgcgccacatactcggagcgcaaacgctggcaccgcgcgtccgc 
cgtgcaggcttcggccgcgtccacacatcggttccctccgaccgagctcgccgaccctgggaaaggcgcggggaggctgc 
aggtctcagtgcgccccgcatgcacacttcacccagcccatccacgcccctggaaggtgggggacactgaaaacccgaga 
aagcatagagtgtggcccaaaggcggggccgactggcactgatcggctctccgtcgtggggatgggcagcggcgggctct 
ggagggctctgcaacacgtcagggatggagagggagcatggaggctgagccaggccttccccaaaagctgaagcccctcc 
ttcccaagcatcgccaccattcccagaacggccacacacgtccaaagacgggtccaggggctcctcctacactcaatggc 
actacctgacgggggaacaatgaggctttctggggatctggccaatggagactttaaaaccagagatggcatagaagaca 
gtggctgctcctagtctcttaatgacaggctgtgacgccatacctttcctcctctaccccctccacctcgccgcttatcc 
tccaccatggagaccctttgggtgggtctattgatttctgtttctccctccttcctgacctcctgctcctcctcctgccc 
cctgcatgggtgtctcccctctccagcccttcctgatgctcccttgagtcgaggtcatcctgccccatctgcactggctt 
cctccagcccctccacccttcatcagccaggggagagcggatttatgcaagagctctggccactgaagaacccactgagt 
cctggctgccagggtgtggattcagagaagtccctgacgacacaccgccgggctttgtcctgagccacaggacccaaggg 
ttcatcagtggttttagtgaccaggccccatgataggtgacaccctgctggagacagagaaggtgtctggcaccccagtc 
acacaccaggggttgggtagacagagggtgagctctgcccacatgcccacaggctgtgtctaaatagagccaggataact 
ggaagttgggacttggcagccccttcccaggatttgagtcacaaaaaccaggctaatcgaatcccccccagcctccacaa 
gttctccagtctggagcatctcattcagactagaagcctttctggtgcccagcctgagtggacgtgcacagcctggcccc 
tcatccatctccctgtgtgtttgtgcacaggggctacggggtggtatccccccccagggctgggagcagggtgcctctct 
ggcaatggctgtgtctttcaggctgtggattggaccgcgtggtgagagtgattcctgctgtgacttcagcacctgcctct 
gcaggaaggatgaacatagctcgagggcttagaatggtccctgagtggaggggatagatggcagcggggtgagtcctccc 
ttcctcccctccccaaagggcacactttgctttcctctcacagccttgccaggcacatcctcaggtctcagctgcacaca 
ccagcacctgatgttgcctaagggtttgaacccacctctgcctgttgcttacctgtcacccttgcccagaacacccttac 
ccagtctccaccctcacagcagctcatctcagctgcaagatctcctccaccttaaacctttaccagattccactttccac 
actacactcggtaaagtgacccctccctccgcgacatccatgcagggacggagcagtccaagaaggtggaacaacagatg 
aacaacaccgaagattcaccaaggaatctgtgtgatgggtaggagggctggggctgaggggagggtgagagagttgggag 
atggcagacaataggcaaaagttcctggggcacgggtgccaaggctggactcgggagggggatggcggtggtggcttgac 
ggggctgggaggtgaggtgctcagcacatagtggggtcaattgccaggggtccccaagccgtgcgcacaggcctctcact 
ggcaacccttcctgtggttcagcttttcctggaatatcttgccctcccaccttgtcactgaggagcagggacagagaggc 
cagggttcggggtccaggggaagagaggggcgctcacccagtaacagcagcagcagcagcgcaggccccaggcagcggac 
catgctggaccttcaacagagaaggatggctggcagagtcctagtctgataggcgcccccttcctagtgcctctggagca 
ggagcacagtgaaaacccagctgtctgcgctgatacccctgggaggagcctttgctgcagcaactccccgccctcattaa 
gtctccaccccaggctgcaccgtcaggtaaacctgggagccacgggattcggcgcctgaatgtgcctagacggaccccaa 
cacactagccctgccacgcacacccagcaacacacagacaaccacccagggagacctggtctctcccatgctaaactagg 
aacgctatgaaacagcatttttcttttccttttttatataattctattttaaacttaaaaaaattt ttttgaatagagtt 
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ggggtctcgctctattgcgcaggctggtcttgaactcctgggctcaaaccatcctcccacctcggcctctcaaagtgcta 

ggatgacaggtgtgagccaccatgcctggcctattttaaacttcttattgttgactaattttagacttgcagaaaagttg 

gaaaatgtaagcattttccttacacccttcacccagcttcccctgatggtccatcttacttaatcatagtccaatcatta 

cagctaggaaactaaccttggtacaatcctgttaacgaaactgtagactgttttgcatttctcatttttccactaatgtc 

ctttttctgttccaagatccattctaggatcccacatcacagttgtctccttattctcctcaatctgggagttccttcat 

ctttccttatctttacccttgacacttttgaagaatcctggccagttattttgcagaatgtttctcttgagttgtctgat 

gttttattctgatcagaatgagacacagcattgttttgactaaccaaaaagttattctataagtaaatattgaggttaaa 

aatctcatccaacctgggcaacagagtaaggccctgtctcaaataaagtctcaacactaagatttaaaaagtgaccagaa 

aagcccccactatgatttgtcttgacttttttttaaaaaacaaacaaacaaacaaacaaacaaacaaaaacgatgtcttg 

ctctctctctccctcaggctggagtgcagtggcacgatcttggctcactgcaacctccgcctcccaggttcaagcgattc 

ttctgcctcagcctcccaggtagctgggattacagacgcccgccaccatgcccggctaatttttgtatttttagtagaga 

caaggtttcaccatgttggccaggctggtctcgaacttctaacctcaagtgacccgcccacctctgcctcccaaagtgct 

ggaattacagacgtgagcaactgcgcttggcctcattagcatcttaaatctccacacaggggtgtgttccttactgttat 

aaggagcaaaggatcagtttgaggacaggtaaaataaaaatgcgcttgctgcctagagggagaagtccctgctgaagata 

gctttgcttgaatgagctcaattgcaatgccagtgctgaggcttgttgactgtacggtcaccacagttgctgctgcgcgc 

ctagaacatggtcactttcttgactacctatcctgtctcagtacatctgtctgtggtttgtggtggtccatttcctaatt 

tttttaatgaatcagaagactgtgatgtgctttccgctgtgctaaccatggccgctgaagcaaaatgtaaaccaagatgc 

ccctgcagtggttgtgcttcactctacgacatctgttaccggaaaggggtccagattcagaccccaggagagggttcttg 

gatctcgtgcaagaaagaatttgagacgagtccataaagtgaaagcacatttattaagaaagtaaaagaataaaagaatg 

gctactccatagagagcgcagccctgagggctgctggttgcccatttttatggttgtttctggatgatctgctaaacagg 

ggtggattgttcatgtctcccctttttagaccatatatggtaacttcctgatactgccatggcatctgtaacctgtcatg 

gtgctggtgggagtgtagcagtggggaccgaccagaggtcactctcatcaccatcttggtttgggtgggttttagccagc 

ttctatattgcaagctgatttttttggttggtttggtttttgagacggagtctcgctccaggctggagtgcagtgacatg 

atctcagctcactgcaacctccacctcctgggttcaagcgattctcctgcctcagcctcccaagtagctgggattacagg 

cacacaccaccatgcctggctagtttttgtatttttagtagagatgaggtttcaccttgttggccaggctggtctcgaac 

ttctgacctcaggtgatccgcccacctcagcctcccaaagtgctgggattacaggcgtgagctaccgcgcctggcctact 

gcaaactattttatcagcaaggtctttatgacctgtatctcctatctcatcctgtgacgcagaatgctgtaactgtctgg 

aaacgcagcccagtaggtctcagccttattttgctcagcccctattcaggatggagttgctctggttcacaggcctctga 

cacatcctcttgtgttttggcgtgtgggaggaaagaggggtgagggaaggaaactcaaaaccaagctctgaccacacagg 

gcaggtacactctcccacctgtctgtgggtgccacaagtcaagggaggggcagagagagaagaaggtgtgacagatggcc 

gcaggccacagaatgtcagaggaagcccagttcctcccggggcagcccaagtagctggtagttgggtggccaaacagagg 

gcgtcacagctgagctgggctcgctcgctacccccagctcagcgtccactctgcccctcagtacctcctgctcagcctca 

gggtccatgcctaccctcctgcttcccagtcacttctgcgtgcctcctgcttttctgctgttggccccatgccagctcct 

ttctgctgagcttctcttctccagttctgcagcacagccaggtgatcctgggctccagacaggcctctcccccagtctgg . 

ggcctcccctcttagagccctctttccttcccacgtggcctccccagggttcgccactgaatggagaaggggtggagggg 

gtgctgggcagtcttggggattagccaagagggcagagttggcctccccagggtcccttgtagctggagtcccgcggggt 

ctagacaccccctcctgaagggtaagagcgggggaggtatattaacgtgtatttttagagtctctctttttttttttttt 

ttttttgagacggagtcttgctcttgttgcccaggctggagtgcagtggcacgatctcagctcactgcaacctccacctc 

ccatgttcaaacaattctcctgccgcagcctcccgagtagctgggattataggcacgtgccaccacaccgggctaatttt 

tgtatttttagtagagatggggtttcaccacattggccaggctggtctcgaacttctgacctcaaatgatcctcccgcct 

tggcctcccaaagtgctgggattacaggcgtgagccaccatgcctggccttagattctctattttatgatggtttaacat 

ctcggggtgggggcttgttggctggagagaaactgcttgattcctggagatcagaaacaactcatgcctttcatatgcaa 

accgaccagtcttgagtccatacaccaaccacccccttcaaggaactctcacatacgaaaccagtatttcccctgcccta 

aaccagctcagggccaggcacctacccagacaattagagcccaaccccacgccccaaaccccaccagaatgattcaaaat 

gccaatcctaccctcttcccctggcctgccttgcctccccagtggaaacggcactgtgggctgtggcctgtgccttccac 

tcgctcctgattctgtccctggaccaaacctagtgcctccccactgtggccctgcatggtgggaaactgtgagtaataac 

ttatttcaacagcattggcctctgtgtcatcagtcaccttcataaattaaaattttgcaactacaactgaggcaggaggg 

ggctttagagagcactctcacctggccccttgggcttgtggagtggagcccgaggtgaagttgcctccctgcactcagtt 

ttgggatggttttgttatgcggcaacagctggctgatataggccactcagcagctcttgcatgaagcaatgggagaatgt 

gaacgcccaagggaggcaggagtgacagagcaaagagggtgttcaaactaggcaacaccgtcctgtgcccagacacatgc 

ctggggctctggctgccatctaggctcggcctgccagccaccaccccgtcagccaccaccccaccaaccaccaccccgcc 

aaccaccaccccgtgtggtcctcagggcaccccatgggttgaatttacatacagaaaagactaaccaaggtccaagtgta 

atatgtctttttaaaatttattttagagacagggtcttgctctgtcacccaggctggagtgcagtggtgccaccatagct 
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cactgcagtgttgaactcaggctcaagtgatcctcctgcttcagcctcctgagcagctaggcctacaggtacacaccacc 

acgcccagctaattttaaaatttttctgtagatgcggtgtcttgctgtgttgcccaggctggtgtggacctcctggcctc 

tggtgatcctcctaccccagcttcccaaagtgctgggattataggcatgagccaccacgcctatagccaacatgtctttc 

ttttgacttctactttggtatcttttcttaaatggttccctctgtccccccgacacacacagaatgggggagaggctgtc 

agattctgagctccagaacctcaggtgtagcactgggattgggggtgggggctcaggaaccacctaggggagaagacagg 

gtgggaagaaacaggaaggaaggtccccaaaattatgtttgtttgcagaggccagccaggctccaggggagtgtggactc 

agtcgaaccatagggccccaggaccactagcttctggccagcagtcatgccctccacagagctgggtccgtggaaattgc 

atgtaggagacacaccagactcccaggacagagcccttttgggatggccagcactacccagcctccactggtgagggagg 

tcaggggctgtgtgacctttgcttctgggactgatggtttattgagctggagagtgtgcccagcagtgttctccagccct 

caggaacttctagtgtggctctgggttcctggagtgggtgggtcgaagctccactcggggaagaaacttccaagctgcct 

gcaggtgctggaggtccggtgattcactggctctgcccctgcagttcaagttcctggagtggctgtcagtggccacctgt 

ctttaaatctgttcattttaggagctacctctcaccagaggcaggatcttggcatctggacttgatctgctgagaatgag 

gaggatatgttgtcccctaaggactggggccccaggctgcaagctgtgtggcagagagcccatcctcactcagtgaggac 

cagtgatccaggaaaagccacagcttctccctccccagcccaggggcttccagcatcctggtctccatgataaccaagag 

gtcataaactcatttccataataacctgagcccagaaacctgattagggggcagcaaactgaggggtgggagaggtggga 

gggtgggcgatgagagggggaggctttgaatccaggtccctgcctaccttgggggtcaggcgagactgctggcagaggct 

tctcagggtggctgctgggctcatgagagttctcagggtctgggagaaatggtggagggtaaatgttgtgaatatggtca 

gcaggagaccctggggctggggaggggcataggggactcaaggtgactgggtgctgcccatctggaaggaggcaggaggc 

atgagcccttcccttctcccttccctctccacctccccctggtgcctcactcacccaggggccagggctgtccagtggct 

gtggggcccaactccatggggtgaacgccgcccagggggtggtccctgtgtgggccatctttggggctgagcaacgtgat 

aagagtccaggaggttggcacagtgatcctgagtgggttattgcctccccgcagcatggtgtccagcccagggagttctg 

cgtttactgagtttcttggggcacccatctgctccaagtcaccctctcagctcccttcctgctccctcttcaggggagcc 

ttgggatccaggctccaagtgagcctcatgccctcggctggcacctcctctctctagtcctaacatttcctccaggctct 

gacaccacccagcagcctggcactctccagatgctggcatcgctcagcttccaaagaaccttggatgtccgccccttcgg 

cagctatgtctgctctccttgcccctgggtgccctgctgcccttgatgattccaagccatctttgactgtccccatccca 

tcccccaaggccttgtcatttcctgtgatgttccttcaaaacattctcccctgccctgagactcccgcctggggatgaga 

agcagccgccaccctctgcagcgccccctccgtgctgacacgccaggctctggccaccttgctcctctgcccacagaccc 

tcaatcacaactcgctttgtcagggtcctcttagctgccacccggggcccaggtggtgccctgcccctgtctgttatgcc 

ctctgcccccatctctggcccaaatcatgccatctcccttggcttgcccggagcactcccaagaccaggctatgtcagac 

atggccacagagtgcctgccctgcctagggccctggtgcagggtgagtcctaggacagccatgcttagtattatgtgact 

ccccactccgccaccacccaggtcacagagaactgggttaaggcagggccctggcacaggggcagccagcaccgcagctg 

accagtggtatggagtgaaaagatgtgctgggcccagcatttgggaacttcaagggggtgacagaggtgatttgtgcaga 

ggaagtggcaaagggccgaaaactggtgagacagaggctggacaggcctccgggggcagcatggtacagggactgcaatc 

tgagccagggaaaaacagggcgaagtcaagggtgaggcagccagctggtgggagaagcaggagagtggacaagaggagct 

gtactgggaggtagagggccatgccttgcggtgctggtggtggggcagggatccaccccctcgcttgactgggaggccac 

tggaacctcctgttcaaagctacttctttccatggcctctggggctgctctctgcacctgggggcaaggctgagggcctg 

ccccagctcccacagccccagcagagctctgaggaggggaaccgcaggagtaggctcaggaagcaggcgctcggagccta 

cccactgcacgcagggtcccttctgcagccccagctgcatcgctgcagatgggctcctgggagtcggtagcaacaccagg 

ccaggccggcccctgggagcagaggcagcaggacgctgaggagcatggccagcaggaaggtgtcatggtctgcggggatt 

gggggaaggggcgctgagtcctgagcaggtgcaccaccccagctcctgcccacatgccccccactggcatactttcagcc 

tgcacagggccactgtccatgctgccaccaaagcctggcttgtcacagaagggtggagcccagcctggagcacagtggca 

gttatggttgctattgcaaacctgcagagaagagaagaggagggtcacgtaggattaggaaccccaaggtcacccccact 

cctcgggctctcaccccgtggctgtggcaggcagtcaggcagcgctgaagctcctggaaggcattcttcctgcagcgcct 

gctctggcacacctacgggcagtgcaccaggcagtgagggggacactggcctgcgggattcaaacggcaaggaggggtcg 

ggtgggcagagctcaccattctaggtccacactgggtgcctggctctaccaggcccaggccaagcaggtccagctgggca 

ctggggagtgccaaggctccccgacaagtcacttcctggccatctaggtgaacggtagagtccactggcaccatgtgcgg 

tgcgagcaggctgggctttccaccctggcactgcagcttcccacacagggcatccctggggaggaagtagaggggggtca 

acagctgcagtaccccccttccccaaacccactccatagcttctgctccctccactcagctccactccctacctccctgc 

acagggcaggaagtggccctcgctgtcctggccgcagtttccatgagcatctcccgcagagttca.ccacctggaaacagg 

cctcgggagctgggtgggagcctgaggaagcatgggccaggctgggggcagctcggagaggggctgcgctcagcgggcct 

cagtttcccctgcccatcttccccacagtagaaaactggccccaccagaggatggggggcagggtgcaagggtgctcgtg 

tcctctcaccaggcccccagagctgctggcactgctgctccagcgtgggacatgcgccatcccagcagtagccactgccc 

ctggcacagggtgagccgtccagtaggtaaacgtctgggggacagtgggaggaggtgcccgtgcaaaactcagggaggtc 



FIG. 7 



WO 01/78894 



31/157 



PCT/US01/12245 



acagtcacccatggcctggcggcacagcgctccagccggcttcagctgcgcaggtgacgggtggtggggaaggcagagag 
aggccacgtgcagtgagaggtccatgccgagagcgcggctcggagctgggggagccaggcctacccaagcccagcaccca 
acgggggaacctgagggcaccaattaactaaggccaacaggccggctcccaagctccccgaaaccctcaccctgaacctt 
ccatgccctcaccaggcagcgcacgcagcagtccccgtgggcgcactgggcccccgggcgcagcgagcagttgtgagcaa 
agcagcagaggtcgcggcactcctgggaccagaaaggcaagaagggcccaggtgagggcgcagcgccccagacctgagcg 
gagagggcaagtgggggccgggcgagccgacttaacctggccagggccgcagtcacactcctcgcccgcttccacgaagc 
cgttcccgcagagcgccggcggcaccgggagtccggggtccggggcattggagaggcaagcgccgccccccttgcggaag 
aaggcgcgcagctggcggcggctgcaggcgctgaacacgcgcggaaacgggtgcctaccggcacggggagggcattgggc 
atggagggacagtcccccaacccccgcgcttctctgatccccacccctgggcttggctacagccgccagacgcgcagagc 
cc agagaggggaagtaacccgcgcaaagtcacacaacaagcgggacaggggacgatgcggccccaatagtgagcagcccg 
ggacccaaggtggaatcgcgacccgacggtgctcctcccggtgtaggagtaacctcgccaggttactcggaaaataatct 
teat accgttgagaatccactttgcctgagcttcttccctttaagcct cat aaaccaccctgaagcggacactatga tea 
ttatccccattttacagaagaggaaactgagggacgaccaaagaaacgcagcggaggaagtccccaggactagccgcccc 
gccgcagccccgaccccccacccgcgtacccggtggccgcagccatgacgcagcctccggactcggccgcagcctccacg 
cagcagccgtcggggtcgtggctgaggccgaggctgtggccgatctcatgggccatggtggctgcggcgccgatggggag 
ctccgagtggtcctggggggccgtgggagggcggtcactgcggccgtagagcctcctgtctctccctcgcccccgcccgc 
ggggctcaccgtgctcacgcctcccgagctctcggcgcggcacatgccctcgacgggcgccaggcccactgtggcgccct 
ggaaggcgcggcccctgggggcggagcgcggcgtgaccaggcggggccgggaggtgaggccgccccacccgggacccgcg 
tccgggtcagaggcacccacgtgagcagctgcgcggagtcgtggggccgctgcgcccacagcccccggcgccactgcagg 
aaggcccagagcgtggcgttggcgtcctgcgtgacgcggctgcggtcccgctcggtccacacctccaggccggtcagcgc 
cacctgaatgtccagagtcctgagaagctgagggcgaggcggggctgaagccgggacagggcgccccatcgcgccggtgg 
tccttcgtggggcgcccttcctcttccccaaaccccaccagcacctgcctgtcctgccgccgccacccccatcaccgctc 
tctccccgccgcccccaacctggtccacgtagttggcgacttccaggagacgctgtttggtgtggttcaagtttcggtgc 
cgagtcaagaactgggaaggcagaaatcccggtggcttgaggggctgagctggccccatccctgaccccgccaacccctg 
gggtctctcctcaccagggtgtggtctgccacaatgtacagttccaggtacttccgggtcctgcgcgcttctcgcctgcc 
ctgcggaggtgcaaatggggaccctgagtggaagctgctgggcttgagccctgacccccaaccccagctcccagaaggaa 
gtttaacatgttttctggaacttgtttcttcagacttcaataaaaatactgggactcgaggcctgtgaattcctgtctct 
tctgatttggagggctatagatacagcattcccactcccatccgatcgatgcccctgaccctgctctggggaccaccagg 
aaggctggtcatgcccgctttgttcccaggatccctgtggccacaggttcctttccaggtgagcagctgctccatccgaa 
agatctcgtgggttgagaagtccttggagccccggggtggccagggacgcagataatagctggcattcctgctgagggtg 
atcaggccactagggtgcagaggggtaggagcgggtgtgagggagctctttccccatcccaggcccagcctcctctccca 
gage tcacct cat cccagagcaggtgcagaggactacccaggagtcggggaagcccctt act cgcccttggtagtggcaa 
tgatcctagggaggaaggggccagccccaaatctcagccagggctggagcaagaggggcaagagggagggtgtggtaggg 
gctggctccaaccgccccttaggaatgcaaggaggagtaggggtaggaatggtgggggggtacctctggcggtgcatccc 
agagcccatggaagcatctcaccgtgtggttgggggccagcaccactggctgcccatctgggccgtagtgggtttctatg 
tatcctggggccagcagcctgctgagagggggtgttacagggaacactgaattcagcttcctcctgcctcctccaggatg 
tctcccagccttcctccctaaatgctaatggagcagctttatgagtgagacactcacagtgtgtcttagggaagggacag 
gagcaatggtgacttgctcagatcagaaactcttggggctagaggaaggagccttggtgatggcttagttgtgggagatg 
tgaatatgggaagcaccagggaggacgccggggaggagtgggaataggggaagagtttgtggttcccaggggacctgcag 
caggcagcaggatccacaggatcgggaggggaggagtcaggagacactgccgaagaatgggacttggagttggggaaatg 
cggtgacctcccccagttcccctgcctgctgccctcctttgttgggcatctggtcgaccctcttgcccccacctgcccta 
gatccttgaaatattttcctcagacttctagaccccacatacctcccacctgtccttcagtgattgatgctcaccccctg 
cctccagagaaaacagaatcgccacctgcccacgctgcttccaccctccctgccttctccacacccactccggtaatgat 
tccatcttcaggctccatctcaacaggatctttcccacacggatggatcaatcataagtcaatctgtcttctttaaagaa 
aatccttaacccaacctcaccttggcctcattacctccagaccacccgctaatgatggctgcttcccccctcccaggcat 
tccaccacctgccccagctctgccccctacccctgccccacacacacccgccaccctaggaggtaggtgatgtgaccacc 
ccattgaaagggtagggacgtcgggaaaatatggttgggcacagtggaactagagtttgttccctgtccatccgactcca 
cgagggagaataaaatacgtgtcaagtgctcagaacagcgcctgcggtcaagcactcagtaggtgatatatactgataac 
ataatctgggtggttttaagagcctgcgctccagcccggacacccaccccacccagcccaaggcaccttcctgagcacag 
gtctgcctgtcccttccccagctcaaaatctttggctctggatggtccaggacactgagcaccaaaatggccttctgtga 
tctggccctcctgggactctccaaattcattcccccctgctcccctccctgtagatggagaccttccaggcaggtcagac 
aaactgctgcccccaagtgtagccactgcatctttttcttttttcttttctttctttttttttttctttttctttttttt 
ctctttttttgagacggagtctcactctttgcccaggccggagtggtgcagtggtgtgatcttggctcaccacaacctct 
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acctccggggttcaagcgattctcccgtctcagccccctaagtagctgggattacaggcgccgccaccacgcctggctaa 

ttttttgtatttttagtagagacggggtttcaccatgttggcaaggctggtctcaaactcctgacctcaggtgagccgcc 

cgcctcggcctcccaaagccaccgcatcttggtccctgccattcccttagcctggggtgccggctcatcttttccctcta 

ggatttctttagactcagcatatcttgcaaatgtccactaggtggtgctcactcatcgccagcagggagctaacaagccg 

ctcctggggttgggagggcggaggtgccccacagcggggctgacagcctcagcggtcctcttcagcctccagggagccaa 

ccacaggcctgcgtgactctccctgtcatctgcaccctctctggggtcctctgcccatccagccacccgcacagatctgt 

gtcagtccctgccccccaacactgatcccctcctcccagccctaccccagcctggcactcactggttcttctccagctca 

agcaggagctcctggccttcagcctccagggccaccagccccatgtctggcttcgagacctgggcaagaaagagtgtgga 

gctgagatggtggcctccaggcctcctgcctgccagggagtaggtggcctgtggagccggctggggaggaagttcttggg 

gagaacgtgggctggggagtcagcaggaccccccacatactatggagggcgtggaggaggtgagaacatacaaagatgtt 

cccaaactcaggatgtttgcagtcctgacaacagccacttggaagggcgttggcacagcctgccaggcacaccagcatcc 

tccctagagaccagaggtcccagaaaggtgcccctcccctggccccgccctcttctttcatgcccagaaggggcatcaaa 

agcaggggaagacagaggggtgctgaggacattatgggggcatcgggtagccatggtcagggcctcctcagagcctctgc 

tacctgaggcttggttccaaatgagctgctgctcatttcctatagaattcaaatttgactcctccacttccaattttggc 

aaactgctccctcttccaaagttttcctgggcctccagcagcccccgtccctccggctccgacacctgcttcactggacc 

cacgaagtaaacatggacgccattccagccaagagagcacactggctctcagctaggtgtcaggaggctggcttggacgg 

ccagccctctctccttcccccaccctcttggcgtctcccaccctgtgggaacaccccacttcccccttgtccactcagcc 

tggctgggggcccagagttggagccggcccaggagcttcctgggaggctgctgcgccttcggaatgtttaacccccgact 

ccttttctccaaaaatgcactggcctggggccctgtccaagggtctcagagtctttggagggagttcttccttcgcaagt 

ggggagcagatggtccttgcctccctggccacaggccccacaaggcctccagcatgagctcatgaggctggaatgccact 

tgctttattggggaaaggtctgcaccgggaaaaaggccatactcgaggtccctgttcctctgcagcccctgctatcttta 

ctcttgccctcctggtaccctgccccttgatatatacccctcatcttgaaatgtgagtgtttcctgccttttggagggga 

tacctagcctctactctttcttctgtaccatcttggcaggcttcctgggggcaggggcccaccggtgggggaagcagagc 

cectttggggctctcctcttggtcacagcccaggccagacagacagggaggcccagaggcagagtgaccccagtgtgtgt 

ccagccttcccctcctggggatggggagggcaatctcaaagctcaggccagtgccgtgcttgaccagtggaatgggggcc 

ttatgggcctaggggatcccagtgagggccctgggttgggagctgctgggtctctgggggcctctcagccttcatggcaa 

tgctcccctgccttccctcttgctggatttggacagtagggctgaaaattccaaacaaagagggctctctaggaggggca 

ggggtgtagccaatggtttaaaatcgttcagaccttagtgggtctcaggctcccagcctaaagagctgtgtgaccatgga 

caatttccccaagctctctgggcttccgtttgcccctctgtaaaatgagcatatcaaggctactgccctcttagtttgca 

gcacagatattatggcacaaacagatggggcatggttattctggaagcgtgtgaagagcgggattgggaagaggctgggg 

cagagcgtcctgcagaagaagcacatggggtggtcttacatctgggggacatcaggagagtgaccactgccccccccata 

ccagaagtggattccacaggagccagtgaggctgaaggttcaggccttcgtggcagggccctgagagggacagcagtgtg 

tccacagggtcacatgttctggtcaactttgcaaaaggttttctttttggtgcttttttttttttttttttttttagagg 

ctcctgaaaagcttcaggacccacaaactctggacccatttctgcctggtgggggtgggggtggcccagatcatccaggg 

agggagggaaagagggaggtggggtggagaaagctgaaatgacttccatgtgtgcgggctcacgagatccagatgtccaa 

accccagtgccttcttctgcccacttgaggggcaggggaggcaggggcctataggagtagtgacttggtggttctgggga 

ccccagcaaaactagaagctgtaatgtagggagagacaaaagggctgggaggttcagggcccctgtggagggcggggaga 

catggcactgaccggctcctccaggctgacggtgcgccagggttgtccatccaggacccagtgcggggtgactggctgcc 

cagggatatgtcctggagtaaagacagagcacagggtgagggggacctgaggaacacaggggcatgggacaaagcagagg 

gaggggggtagaggacatccccagggaggcactggaggccttttggggcagacttcaccttcaacacgcgtgggctcagc 

ctggagaaggagggacgcccgtgggcatccttggatctgaggagctatcaaggaggaggaaaagagaaggctggaaaggg 

acagctcagctggggacacgggagtcccctgacctttgtcggggggcaggcttgggctcggcgatcacaaggaagaggcc 

aaggccgccagtgcagaggggaaggaaagagcgcggcagccttagggatttttagatgggcagcagatgcctttagggtg 

agagatgtacgaagagaggacacttgtgccccccccatcatctgagaaaaacaacagccagatgttgccttgcgaggtcc 

accttgcccagagctccctcggggactctgtcctggtggcagggttttggtaccctggcccagaaggcccctcctcatct 

cttcaaggggaggggacgcttccggacggagccttggtgctccctggccgggtgtgcctaagggggctctaggaggaatc 

ccagagccaagcattactcagagggcgcctggaatgttcccctggaatgctcccagcccctccactggccccaaccactc 

tcacaggcccgccctgcaggagccaggccccaggcacccagagcctgcagcagccctccttcccccggtacccagtccca 

gctcccagaacagacagcctcccccctccacgcagccctggcctcagtcctgctgggctgatggctgcctgtggaagtga 

ctcagctcctgctaggccaccccaactccttttttctcctccaccttctctcccagactacaaacatcaaagacccttcc 

tccaagaagccctccttgattggatgagtgaattgccatcaggcagatgagggccgagaggagtctgccaccttggaaag 

gaggctagaggggccagtgcagggaggg etc tgagtggatgtgggggaggggaaggaggggaggtctct cage ccagaga 

gcacttaactgagagtagagaaccaagctttgctgctcctaggcctctaagggtttggggaagaggtagggtgggcccgg 
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gcacaggtgtggtgtgggtgcagtgtggtgtgtgggtgctgtccacatggccttgcgcgcacgtgctggccacgggcacc 

ctgaccccaatgagggagagaggggcagagctggagctggagctggagctccggtgaccgggtgaatgggggtggaaccc 

gagggagccaggctggtattgggcacatagacgcccctctcccaggggtcccatcacctcccctgaccccaggatagggc 

tcagaggggagggagcagtggaccgcctggggccctcccctggggccagaacagaccaggcccctgtacctgtttggtcc 

ccacacagtgctgtggaagccaccgcccagtctgcatagcacagcccagccccgcatgccccctccctggttgccctccc 

tgttcccggccaggcacttgctgtgcaggactggctaatcctcccacccgcttgcagaggttgttccagccccatcttaa 

catctttgtttggaggggttaccccgaggagacagctgcagtctttccagagcactgctaaacagacaccttctatctgg 

agaggcccttctctatctcaccaaacaaggcaacaatataaacaacatacacactgccctgctgccctgggaggagggac 

gaggggtgagcagggtggaggccacagctagttctgcagcctgagagcaaagcagggactctgggggactcttgggcatg 

ggggcttcctagaggatggagccccgctgagtcctaaggggtggaggagcaggagcgggtcacacggtggcctgcggatg 

gaagctggttgtgagagcgagaatccaggcagagggggctacggctatgggctgggggctgggggctgggctgtccccga 

gggggagggagccatgccctctgctttgccagcggagtggcagccgggcagtgtgggcaagtccgggcccggggccagcc 

caagcacacttgagcgtccctgggcaggtcccacggagacccccccaaagagtccccacgccctgacctactggccgtat 

ggtgccggggccgtgagaccctccgcgcgctgacccgagctctgagcagaacccatccccgccaccaccaccgcgcctag 

cctgcccctcagggcgcaccccgcccgcgtcctcaccttgaagcaccccggcgcctggcactggccagagcagcagcagt 

agtagcagcagcagcaacggggtcccccgagctctccggggcctccagcccatagctgtgagctcctcggcctctaggca 

gcggctcgcaactccggctccgcccaggctggattgcggccgacccgtgcccggtgcagcctcaggccgccgccttcgga 

ccttcccgcccccacctcccaccgcccgccctcgctcccgcctcccctccccgccaaccccgctcggagcctggccaggg 

gccccgacggcgcgcgccatgggggagccgggtcgccactcccggaccgccgcccctcgagggggtggagctgggcggag 

gagggaatccgtgcggcccctcggatgaccggcccgagccgtccctccccgtcggtctcagagggcctctactcctgaga 

ggaggagagaaccgctgggaaggttcttggaggaccgcggcgtggtgggatgaggcggtgggcaaaggccgcctctcgct 

gctgaagttggccccaggagcgcgatcttccgtggtctcctggggccgatctctgtcccctccttgctacccgtcctgcc 

ccgagggtgccctggcggaggttgagtcgggtcatccacctgcactgggtgcccccaaggataggaaggttcaggcaacc 

ggctgccgctgtcttgggggcttcattgctgggcaaaggcgatgcagcagacggagacaacctttcttccctggcggtgg 

ccagagggcagaattgcataaaagctgcagactcccaggcctgggagaccctttcggcctcagtaacatctgtttcatgt 

tttaaacttttgttttcctactcggtgcaaatttggatgagatgttaactttttttttttttttttttttgagatggagt 

ctccctctgtcgccaggctggagtgcagcggcgcgatcttggctcactgcaacctccgactccctggttcaagcgattct 

cctgcctcagcctcccgagtagctgggactacaggcgcgcgctaccacccccagctaacttttgtatttttagcagagac 

gaggtttcaccattttggccaggatggtctcaatctcctgatctcgtgatccacccgcctcggcctctcaaagcgctggg 

attacaggcatgagccaccgcgcccggccggagatgttaacttttaagcaaatctttttttttttttttttttttgagac 

agagtttctctcttgttacccagactggagtgcaatggcatgatctgggctcactgcaacctctgcctcccagattcaag 

tgattcttctgcctcagcctcccgagtagctggcattacaggcattcgccaccacgcctggctaattttgtatttttagt 

agagatggggtttctccatgttggtcaggctggtctcgaactcccgacctcaggtgatctgcccgcctcggcctcccaaa 

gcgctggaattacaggcgtgagacaccgcacccagcctacttttaagtaaatctatttgtttttgagaatttggaatgta 

gtaatttggttagtgaaagttcgagcagtgagagaaacctacattcacatatctcaaaatcaaaaagtacagaaagcata 

gggaaaagtctccgtgctcttagccctcctcaccaacaggaaaccaatatgattagtttctttcataggcttttagatta 

ttttttcacactcaagacaatacagacatatttttttctcttattaacgtttttctgcactttgattttctttttttttt 

tggtcgcttaatacaccttagatatcagtgcgtttagagggtccttgttgttcttatgattattatttagagacagggtc 

tcactctgtcacccacgctagaggacagtggcctgatcatgcctcattgcagccttgaaatcctgggctcaaggtatcct 

cccacctcagcctcctgagtagctggaactacaggcacacggcaccaggcccagctaaaattttt:aattt.ttctgtagac 

agggggtctcactttgtttcccaggctggtctcaaactcctggtcttggccaggcgcagtgtctcatgcctgtaatccca 

gcactttgggaggccgaggcgggcagatcactggaggtcaggagttcaagaccagtctggccaacatggtgaaaccccat 

ctctactaaaaatacaaaaattagccgggcatggtggtgagcgcctgtagttccagctacttgggaggctgaggcaggaa 

aatcgcttgaactcagaaggtggaggttgcagcgagccgagatcatgccattgcactccagcc.tgggcaacaagagcgaa 

actccgtctcaaaaaataaaaataaaaataaaaagaactcctgatcttaagtgatcctcctgcctcagcttctcaaatcg 

ctggaattacaggagtgagtcaccacagctgtccagctacgagattattacttattattactactttggattttcaaatc 

aacttcattaaggtataatttacacacaataaaatgcacttattttaagtggccagtaagatgagtttcgataagtgtat 

ataactacataagcatcactataatgcagacacattccctcactcacagaaagagccctgtgcccttccagcicaaacttc 

cccactcccaaccccagacagccactgatctgttgttctctgtctatagataagttttgcctgttctagaatttcatata 

aatggaatcatgcggcatgcactcttctgtgtctggcttccttccctctttccgatgtttttgagattcatttacactat 

tttgcatatcaatagtttgttccttcgtattgctgaatagtgttcggtggtttgagggaaccacagtttctctactcacc 

agtgcaccatagggttattttccagttaggggctcttataattggaactatatttgcacagagagagagagaggaagaaa 

gagggagagagatatttattatagcaattggctcacgtgattatggaggccaaaaagttcccgaatctgccatctgcaag 
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ctggagaacgaggaaagccagtggtgtgattcagtttgagttcaaaggcctgagaaccaggagcaccagtatggaggtgg 

ctcgagctcagaacaagttggggacaggaaagcagagcagcaccccagagcagcccctcagcgacacctcttcagtaaag 

caaggctgaacacagaggggctggcttcagtgtggatgtcaggtacagaaggcagctcgaggagctactctggcgttctt 

gcttactggtattcttacctcgaactggccaactcctacttaaactgcaggccatggctttaatgtcctgtcattcagag 

gctgtcccttacccaaagccaggttagcatcccctgactgacacttctccctgcaacacgtttcagaaggccctgtagtc 

gtccacttccctgtctctctccccaagctcctgagctccatgtggtctgggaatatgtgtgttgctcacttcctagcaca 

gtcagtgctaataactgactgtagaggggacacagtcgaaaagccacatggggatcagagtcatccttacacagttgaca 

cctcccaaacccagatgagctgtgtccaagtgcaggtcagaggaattttctgccgaagtctctgagaaagggtttattta 

cattttgaggttgcaggggaggagatgaggccatcaaaccaaagctgaggaagagggatcctaggatgcaccgagcagct 

ccgggggcgcctgacagcacctgggaaagatggcttctccactggcttgttggcgtcaccctccagaggggcatcaggaa 

atgtcctgggaaccaggcaaaccagtgagcattaacccttagaagtgcttggcatgggtgacacccaccatctgtaaaca 

cgacttctcccaaggagtgacgcagaacaggatgtctgagggaggcactccgactccagccttcagagatcgccagggtg 

gcacctggtgacgacaggctgatgcttgggtgccccagaaaaggtcatgtgtgtgaatgggggccccaaagccaacgctt 

catccctgacagcctggtgcatttagaggggaactttttgtcccttggcaaggtgggtggaatttcaggttcatagggca 

agggtattttagctttaatagatattgtcaaacagttttccaaagtcattgtacacactctgtgattctacttatgtaaa 

gtttaaaaacaggcaaatcaaatctatggtgttcgaagtcaagacagtagttacccttgtgggggctgcaactggtacag 

agtgtaaggggggactgtaggatggtctatttcttgatctgggtgtgttcgcttttggaaaagtccttgagttgcattta 

taatgtgtgaacttttctgtatgttacactttaattgaatgtacaaaaagtctcaggaggcctcagaccactggaagcgg 

acacaactaacccctctgagagcctccaatccaagatggacatatgtccccttggaagtatgcagaagcaggtgaagact 

cctaagccggatattcccaaatccccccagtagccgcagcttcagcagctgcttatggtcctccctacaccctctcttcc 

ccagacagcccccaaacatctggctgcatttgacttgctctctccctgtcccacctctggatttagtccatgttctccac 

cctccccactgtcagcaatgtagacaagacaaacgcttagttcacgtgcccacctactgcgtgccatgcacggggctggt 

cattgtgggtggcaaatgtgagcaacacacgaagcctcaaggagcagaaagggacacaaatcacttcagcgtaaggtaat 

ttgtgataaatgtcatgtaacttgcagcccctggccccctcctacagatggtgtctaagaataaaccccactaacatgtg 

actcctctgttctagcccagctgtttgggttgcaagaaagagactcactccagttgcgtcataggatggagttttattgg 

gaggacattctggacgggctcccagcaagagtctggcaatggagcatgaaaatgaatgagcctggaatgagggagggtgc 

ggcctcggctacacaaagttcacccggcccccaactgcctcccagcgtgttagctcctgtggcaactccccacttctctc 

tctgtttttcaacccaaatcctagagcagagggctcttcgctcctcacaccatctccaccaggctgcagggggagtcact 

agcccactcaagagctctctcctgttggtccctgattcgcaggggcacgtcattccttcttggtaactcattctctttaa 

cagcccgacacagggtctccaggaaaaggacaggagctcacacagtcatgaacagagatgcatctggagcaggaataagg 

atgctgacaacatctgtgtctgccctccactcttgtaagtgccagtaaggtagtacccagtccctgggggtagggggggt 

ggtggtgggaatgagggcatctcttctttcaggggccaaatctggcacaaggatgtctgcttaggcaactccactgcctg 

gcatgctctctccctaggaaaacaccaaaccttttttatttcctcagtcttagtgtgagagtgatcacctcttccaggaa 

gcccacaaccagaatggccaggggcacaccctggccctcagtagatgggcactgagacaaaacatggcctgctggtggca 

ttcccaacaatgtcaaaagtctcagggatgagctcacagttgggagtcctttggaagtccaattccaaatgtcctctgga 

ctcaaaataaaaactacatttcccagtctcccttgtagcttgcagtagccatgtgacaatactccagccaatgggaagtg 

agtggaaaagtcctgtgcagcttctgagttgcgtccaccatgggcatgggtgtgcttctacctccactcttccttcttct 

ggctggagggcatgtggacaaagcggggccatggtgagcctcacaatcaaaggcatcattttagggatagagaaaaggaa 

gatagaaggatcctgagccccaacatcataaaccatcgtaacagccagactagcgtgggagagaaaaataaaattctatc 

atgttgaagtcactgtatttggtctcttgttagagcaggctgactgatactctgttgaatacagaatgccttggtgagcc 

tctgaggatgcaggatgctgcaatccaaccaaagtccagggtgactttcgaggagaaggatgtgaaagggcagggcttcc 

tttgagggagaaatggagagaagggaggagatccagagaagagagagcgagatcagctcagtgctcttcagtgtcctcca 

gggtcaacaccctcctgggcagacctctcaccacccatttttgggggggcaagaggcttggggccaggtcataaaaagtc 

agatgtcacagagctgttttcgtagaggcgggcaggactcaacactgcctcattcacattcataggctggagtcatcaca 

gagtctggccactttctcctccggagggcaggcaacaccactggtcagcccaggggtggggcacaggttgaggtctcaca 

tgtggctgcatcaggatc.aatgagcttcccacagagaaaaccctggtcacaggag'gcctctgggagccagggcaggcccc 

caggcccccagcctcaggtggggtcccaagaggattggtgggagtgtgcatggctggaagtgtgacttggagccctggcc 

taccaggtgaatcagctgcagaagggacacaccacaggggaggagaaactgactttcacttctgcccaggtgacgggcag 

gcttgggagtggggaagaggcctcagaacaaaggtgggggagtcagaacatacgtgtcttggaaatggaggggatgctgc 

caagtccagaaagtctcccctagtcctctaagccagccgccagctgacacagggattccctctgccccgagcagaacagg 

gctctcctatctcctgaggggctcctggttccctgcacactctccccaacctccccttggtcccaggcaccatcctctaa 

atgacagcagtctccagatgccagttggcactgtgggttagctgggtattatatctgcccgtgtgtgttgaatggatacc 

tgcgtggtctctttctgaaaagccatctccaccgaattctcgcccatgctctgcttacaaacacgcctccttctgcaagg 
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cccggagtggactcagagagcctctcccctccccacctcctgcccccattcctctcccgctccccacaggctggctccag 

accacaagagcgtgggaacctcatggagagtgggtgcttcctctgtcttcctccctagcccttgctttagcacagccagg 

gcagagaggaggaaagaaggaaactacagcaaggaggatttagggacaattctagaagggatttccaatttggaggggca 

gtctgggaagtaggcagcctgtgaacttaataggaggtcctaaaggacctgggggaatttggggagaagacgaggtgggc 

tgccttaccccaatcttctggcccactcccagctccagacccacctccaggtgtagcaggcccccaggcccaacagcagg 

agcaggaggcccaccagcagtcccaggacccagagcagctgctggaactgatgcaggcggtgcagggctggaacacagag 

cgggactcagagcagccacagctgcaggcccccagtgcttccttccaaggggacccacccaagatgacaccttctaactt 

ttgctttatcgctgaagctctccacacaagggtgccccgagcaatcagtttagagtcgacaggcaaatcatgctgcagca 

ggagggatacagggaaggaagcctttgggcctaagagctgggcttttgtgcagagtggcctgggagatggagccccctcc 

cctcacctctgaccccaaagtaggtggaggtagaggcagagcccaggacatttgaggcagaacagatgtattccccttgg 

tcgctgggcctcagcgcctcgatgtccaccctcagggcattgggggaagccaggacatggatgtgtggctctgcaggagc 

accctggggctgactggaggccaccagtcgactgccaaggtggagagtcaggctggcgagcggctcgctgtccactcggc 

aatccaggatgccccggaggccaccctcaggctccacgaagaccatcatggtgggcgtcttgggagggtctgtggggagg 

aagggagtgtggtgattgcagaaaatgaccacaaatttctccctccctgtacccatgcacagtgactttgctgctcctcc 

cattaagaggcagaacctatttcctcactccttgaatctgtgactcagtttgaccaattgaagaaagaagtgatgttgtg 

caacttcagagccagacctcaagaggccttgtagcttctgctctacttgttggaacacaacaatgtgggaagcccaggcc 

agcctgctgggcacatgtggcccaactggcagccaggagcgtgcagtcatctgagaccactgggcaactgcagccacatg 

aggtgtgaacaacagaagaactccccagctgagcccagcccacagaattgagcagataacatgacttttgtttgaagcag 

taagttttgttgtggtcaatagcaatagctgactgatacagtgtaggccttggtgaaggctggagtgggagaccaagact 

gatgagggcttatcgtgtgccagacacactcagcacgctttatttcttttggttttttctcttcttttcttttctttttt 

ttttttttttttttgagacagagtctcagtctgtcacccaggctggagtgcgttgtggtgatcttggctcactgcaacct 

ccacctcccaggctcaagtgagtctcctgcctcagcctcctgaatagctgggattacaggtgtgtgccttcatgccgtgc 

taattcttgtatttttagtagagacggagtttcgccatattgaccaggctagtcttgaactcctgacctcaagtgatccg 

cccacctcagcctctcaaagtgctgggattacaggcataagcccctgcaccagccatttatttcttatacaacttgcagg 

atgaggcgaatgatgtattccccattttacaaggtcacccaaccagacagcgggagagccaggattccaacccaggactc 

aatatccgtgaagccttcactcttggcagacagaagaggaggaagaactgaggtgggcgtgtgagatgtggggagactaa 

ggcggaggagggggttcacactcacagagcacacggag'catgactggagcagaggcagcagccccagtggggagctcagc 

caggcagtggtacatcccagcttgagcacgagccacgtgggtgaaggcgagagtgggcacaggctccgcgtgcagccgcc 

ggtcattccagaaccatgcaaaggtggagttgcccacaggcccagggccacccaggaggcggcagctgaggttcagggca 

gcgccctcaggcacgtccaggccaggctctgccaccacgcgtgcacctgcgggcggaggatagagagatgattggggatc 

tgtaggccttgggggctggagcctctgggtgggacctctgaggggcctctgcacattgtgggaaggtctcagtctgactt 

ggtatagggctttccaaggtggaaccatgccccctccagtgggagctgtgccccctccaccagattaggcttctcaaggc 

agagctttctcagaccagacagcccgctctagtgggagctccagcccctctcgtggacttggacctgagagcatcagagc 

ccctcctcctcccctctcttccactcaccttctacctgcaaccgcccgatggtgctgattgagcccagcaagttttgggc 

tgtgcaaacataggtgtcatcacctgcaggcacatcttgcacctgcagccgtagggcgtttcgggccacctggacatggc 

ctgtccctgatgccaagctgtggaccccgctgctcgtggccagcaccttgccatcatgagatagggccagctcagcaggt 

ggctcactgtccacagtgcactgtatcacagccatggatctggccctggagtcccggaaggaggacaggacagcgtcctg 

aggggcatctgtgggcaggcagggcacagatggggaccttgcttaggcaccctgtactgctcattgcctagctgcctggg 

gttggtcaggctgaggcctctggaccaatggccacttcctagaagtgacaccttcccaggagtcctgtgggcagagtgct 

ccaggcagggcttataggactgtctctttctctagcacctatgtcctctttccccaactgcccattcctgggcccactgc 

agtcctttgcccactcccaccagagcccagaagcaccctccaccaccctcctcctgggcagccctggccaaggaccctct 

gctcacagaggacttgcagggcagcaggacgggagctgcgggtgccctgggcatcctgggcctggcaagagtaggcgcct 

gcatgagcccgcgtggccaccaggaatgagagtgaggcagctggaccctcctgcagccaacgaccgttgtggtaccaagt 

atagagtgtgggtgcgtgggcagcagggtccgcacaggtcactgtgatgggggcaccttcaggcacggcagcctccggag 

ccaggatcacccgcacacctgtgccaaggaggccaggatcagccgggcccagccggacaccacagtgggcttccatccca 

gtgggcttggcctaggccctgccttaccctccagccgcagctccagggacgtgttggcctggcccagagggctgcgggca 

gagcagctgtagaaaccctcatccctgggctgtggccctcgcagctccaggcgcagggtgttggggacagaggctgctgt 

cgaggaggccaagaggcgaccggcgtggctgagggccagctgggcgggcgggcggctgtccacagtgcacagtaccaggg 

ccagctgcccgccatggctctccaggaggtaggtcaggcgcaggttgcggggcgcgtctgcagggcatgagaggcttaca 

cggagtcacaggcggcagcctgcaggaggccagtttgcaggtggcattcaaactcaggagggccctggctccccacccct 

atggctcccaaccacccagtctgaggcactgctcctctgcccagacaggactcaccccagcgccccctactcttaatgct 

ccttgcccagtgctccactagctactgggtaccgagttccccactggacacctgtcgggttccggccatgccgtgatccc 

tgtgggcttctgtcattcctgtgagtcccaccctgcatccagccagactcacagaggacgtccaaggtgataggtctgga 
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gaggcggggtgcccgaccaggggggcccacaccgcagcggtaggaggtggcatccctgactgtgacgttgggcaggggga 

tggagtgggcatccaggcgctgctgcccatcctggtaccatgtgtaggtgagctgggccgggtgagtggtccacacaagg 

caggtcaggttcaccagctgcccctcccgcacggtagccccgggccacacctgcacattcacagctggggagagggaggg 

cacaggacaccagtgagggtcttgaggctgtgtacagcaggccacctgtctccatgtgggtgagctactcctactgctgg 

ggagccccttggcctttgggagccttgggtggcccacctataaatgtggcttagaactcaagcttgggaatagcccactg 

gctcctgagtggtcccaagtagagttccacagagtcctggaaagccctgagcccctccctgaccctggctgggggcaaat 

ggggagaccacagacctcccctcccctgctgcactttctcctgaaattccctgtttagttttatccactgggcatacttg 

taaaatttggtgtttataaaaggttctcctaagaaactcaagtctggagtccaccatcgttaggaagggtgggaatgggg 

actattcccgtgcccccaggcttctagaaccctgtcccacctcctctcctcttttaaaggacacacacacacacacacac 

acacacacacacacacacacacactgaccttgagcgtcgaagtcagctgaggccgaggcctggcccagggtgttggaggc 

ctcacagatatagacaccctcatcctccagcatagccccgtggatctccagacgcagtgtgttgggggccacagccacat 

gcagcctgggagagctgccttcgggtccccccacaccttgtagggtggaggccacaaggcgatccccgtggagcagccgc 

agctgggccggagggtcactgtccacacggcacaggaggaggcccagtcgtccaggacctgtgtccatcagggtagtgag 

tgtgacgtggcgtggggcatctacacagggtggggagtggtcaggacccagccaggggggtccctcatccagggctctgt 

catgtgacacagcccaggactgggggactgcattccttcccccacacctggatgggacaaaagtccttacaggacacgtg 

gaggctgatgggtgcagctaggctcgtggtggctgagcctggggcctgggcttggcaatgataggccccagcttgtgtca 

aagttatggctgcaaagcggagcgtggccgaggtcgactcctggaggggctggccatcccgataccaacgatatgaggtc 

ccctctgggactcctgtgggtacctggcagctcaggaccacagcctggccctcttggagctcaggtgatggtgacacctg 

gacccaggctcctgcaggggaaaaccaagagcaggtgagggctctccaccacacctctcacagtctgggaccatgtgcgt 

gtccacctaggactacacaacccaaccccatgtgctggagccgcagccccttccagaccaccgccagggcccactcagcc 

ctatgccttggcccctcctgctccccactctcctgcacaggctccatcctggcatttacctcccaccacctggtgggatg 

gttttactgtgtcagtcatatctcgcctaccaaag.cgtaagccttatgggcactggacttgtgtgacctgtaaccccagt 

tccaggcactgtgcctgcatgtgttagctcctgacaaatgttggttcaacccataaagtactgaaaagggagggatttag 

atcatcccagggccattgtcctaactaggatctggactcagcatgggtgaccgagggcttggaagggaatgttagcagct 

atgttatcttgagcgctctcgccaggccagccctgcatccagactccaggccacaagagcacaggacctggggaccaggc 

aaagggcagctcctcagtggcccctgggtgtgaagcaggagaaccccaggttgcagggagtgagatggagggacagctgg 

aatgctaagcaagaacacagaccatgcctgagaccacagctagcagggtcacgcaactggggaaaagctcctaaaactca 

aaccttcaatttcttcctgcgaaataggtatgctaagtaagaaaagatacacagaaccaaggctcaaagtaaatgtccca 

aaattggtgcfcgtcggtcaccgtggtggaagaaacactcatgttcctttgaccttcgcattatgggcattatctttcctt 

tttcctggtttatttgtttagggagaattttttttttttaagacagagtctcactctgtcatccaggctagagtgaagta 

caattttggctcactgcaacctccatctcccaggttcaaacgattcgcctgccttagcctcccgagtaactgggattaca 

aatgcccaccaccacacctagctaatttttgtatttttagtagagtgttggccaggctggtcttgaactcctgacctcaa 

gtgatctgcccgcctcagcctcccaaagtgctgggattacaggcatgagccatggtgcccggcctgagggaggattttta 

tagagactatatatatatacacacacacacacacacacacacacacacacacacacacacacgtatgtgtatataaataa 

agaatatgaatatattatgcataagaatacacataatgtatatttgtatgtgtatatactcatacataaacatatatgca 

taatataaatatacatatgtatgagaatgtttactatgtaacaaaattatatgatcgcaaaacaagttatttcagaaagt 

gctcccaccaagcaagccactttgtgaattgccattgcctcctagtggctgctgcagttattacaggtgaaaatatctag 

ctggaggcaaagggaggccttctgctggctggctggaaactgcccttaccaggactgaaaaagacatatttctgcacaaa 

attcacagcacccagcccaaagtctgaccagagtagtccccagggcatcagagtctaccatatcttcagaagactgacac 

tcacctcggacctggaagaagagtgaggtgttggatgatccaagaacatttgtggcctcacagcggtagctgccagagtc 

cccaaggcccagttctcggacctctaacttcagggagttggcctcagccttagcctggaaccgaccatgggatgggacct 

ggggacccaggctggtggccaggaggtgctccccatggaacaaggccagcaaggccagggggcggctgtccacagtgcag 

atgaacagagccatgtggccctggcccatgtctaggagggctgacagctttggacggtccgggggatctgcaggaacaga 

gggagctgaggccacccagcctagctcaccacattacaactgccacactatgtcccccacctcctgccacacacacagccf 

taagccatgctctttcctccccaaaccccagcccggcccctgcttctccccagaccacccctccaccctccaagtcccca 

ggctgcccatgccagtcacccacagaggacactcaggagcacgggagtggagagctgggcaccagcctcagtcaggatgc 

ggcaggcgtaaagggcagcatcagttctggccacgggcagcagtgtcacggtctccaggggaccctgggcccacagcacc 

ccatttcggaaccaggagaagttagcagggctgccagcagcttcccggctcacgttgcaagtcaagttggcttctgtgcc 

ctcctgaagtgtgtgtgatggtgcaatggccaggacagtggctggagagcaggcggcacagcttactgaccacccccggc 

ccccaggagccaggggtccagccgcccagcctggaaggagcaggaaggaggccccctggggtgcccacagggttggggag 

aaaagcaagctagctcacagggcagcctaggagaagtgggccctggggactgtgggggccctgggcagagagggttgcag 

tacagggaaggaggacaaggcagcccaactgtcccagctggggagtccttcctcagagaccagctgtgttgcctatcccc 

ggcctgaacagagatcatgggaaggagcagctccgctcttcctgaatgtggaggagggcagcgaagggcccccactgctg 
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cacagagtggttttgcctgattagatcctcctcggagcagaacagtccagagctccagcccctgcccccaggccacccat 

tccctgcctgactcaccctggccattgaaggtggctgaggtggaggcgttgcccagggcattgctggcctcacagaggta 

caagccctcctcttccagcaaagggttgtgaatctccacacgcagcaagttgggggctttggtgaccttcatgcgtgggg 

aacagcccccacaggtgctgcagccaccccctgatggcagggaagtggccacaacacggtccttgtggagcagctgcagc , 

ctggcgggggggtcgctgtccacacggcacaaaaggaggcctcgccgtccagccccggccccagcggcatcaaggtccag 

cctggtggtgaatgttggttgtcgaggggggtctgcagggaggaagaacatgggcactcatcccacggatgctccagggc 

cccacaagcctggctaggctcccagaatacactggacaaaggcagatcccgaaggctgcccpaagcagctgtgcagactg 

tgcactgcacaagggcagccagaccagggtgggtggaactcaagctaggctcatgctccccaagctaagctgtgccctga 

tgcctactgagtcttcccagaagaaggagcaccattttctagctcagacaaaggtgctgtatgggctggctgccaccttg 

cataagtcccatctgctgctgagcagggtgcccacctagcaatccctgggggtcatgctgtctgctcccaccctgcaccc 

aatgccttaccacggagcctcccctcaagatctgccctgcagcttatctgtgatctccaagccagctcctgtgatactag 

cgaatgcatagaatggcctcttcctgaaggcttctggagacagttggccaaaagagagcaatcagccaaggcccccatgt 

gatctcactcagacaccagaaaactttgcagaggcagctgccctctgacagttccgggacgtggctccggcacacatgtg 

tcaggggaagcctctggaatcaggcccttggtcttacacggagcctccccgcttggcggatgcagaggggctgtctgtcc 

ctgtgctgaggggctttgccttggtgtctatgggagcccagcacagcctgtggggcatgtgcacaaatagaggagtgtgg 

tgtcccagggctcacccatgaggacaaagcatggatgtggtgcccatgctgagcttgaggctgggacagaggagcccaca 

gggctggcattggagggtaaagacatggggaggtagagaaggccccagcctcatttcctaaacttcagccccacgtggaa 

agacccactgtggccaggtctgtgacccagccctcccgatatagaccttggggcagggcaggacagggctggggggccta 

gaggaggtggggtggctggtgggtcccggcaggaggcfcgcaacaggtggtggctgctcacagagcacagtgagaacagct 

ggcgaagaggggccactggcactgtggccgtcccgggcccggcagtggtatgagccggcgtcagtgctggaggccgcggg 

gagcaggaggctgctgccgggaccctcgtgaagcagggctccattcaggtaccaggagaagcgggcatcaggtgtggggc 

ttaggccgcttctgcagctcagtgtcactgcctgtccttccaccacctcggctgccgggctgatgaggagacgggcggct 

gcggggagaggaagaggctgggaagggtccctcctctcaaccccacatgctgccctatatagaaagccttccagggttct 

cctttccccacactactgtaggtagctctgggctttgtggtgcatcaggagggtttgccctttaatgccaaatcagatct 

attggtagtagatggctgcagcatggttgaagattcagagccagagcccaggcttatgtccaacacctggcttggctggg 

catggtagctcatgcctgtaatcttagcactttgggagactgaggcagaaggatcgtttgaggccaggagttccagacca 

gcctaggcaacatagtgagactccatctcaaacaatttttttttttttgagactgagtctcactctgtctcccaggctgg 

agtgcagtggtgtgatcctggctcactgcaacctctgcctcccaggttcaagcgattctcctgcctcagcctcccaagta 

gctgggattacaggtgtgccaccacacccagctaatgttatacatgtagtagagatagggtttcaccatgttagccaggc 

agatctcgaactcccgacctctggtgatccacccgcctcagcctcccaaagtgctgggattacaggtgtgagccactgtg 

gccagcttttttttttttttttttttttttttttgagacggagtcttgctccgtcagccaggctggagtgcagtggcgca 

atctgagctcgctacaacctctgcctcccaggttcaagcaattatcctgcctcagcctccctagtagctgggaccacagg 

tgtgcgccaccacacccggctaatttttgtatttttagtggagatggggtttcaccacgttggccaggctgatctcaagc 

tcctgacctcaggtgatctgcctgcctcggcctcccaaagtgctggaattacaggcatgagccaccatgcctggccacaa 

ttttttttttttaattagctgggtgtggtgacatgggccgtagtctcagctatttgggaggctgagatgggaggatggct 

tgagcccaggagtttgaggctgcagtgagccatgaacataccattgcactccggcctgggcaacagagcaacaccctatc 

tcaaaaaacaaaaagaaaaacctggcttgatcaattagctaccatgccctcaggaggagggaaggacagtgcacataccg 

aaagttggaagaccgtacttttcttttttctttttcttttttttttttttttttgagacagagtctcactcttgtcaccc 

aggctggagtgcagtggcgctatcttggcccactacaacttccacctcctgggttcaagcgattctcctgccttagcctc 

ccaggtagctgggactacaggaactcaccaccatgcccagctaatttttgtatttttagtagagatgggatttcaccatg 

ttggccaggatggtctcgatctcttgacctcgtgatccacccacctcggcctcccaaagtgctgggattacaggagtgag 

ccaccgcgcccagccagaagaccctacttttctatttggcttcccacatctgactgctagcatagagcctgctcccagag 

tttcataattaaaaaacaatgaatgcttctgagggactctccaagtttagggtcagggtaggtgcaaaaggaatgatgtg 

acctgttgtgtttccctttttcccttgacttccaggaagctctgccgttgggtcactgcacagcccctgtcttttatgtg 

gcgtagccagttagctcagtcctgcggttgagtccactagacttctagaaggaacagactggagcaggctcctcctcagg 

ctccctccactctccctggctgccagtgcccatcttaccattggcatggaagtccagggtggaggttgcatttccaaggg 

agttggtggctgagcacttgtactccccactgtcagtttcctccaggtctcggatctccaggcgcagggagttgggacca 

gaggtaccactgaagcgtgggctgtgatcactgtccccggaggtggaggccaggatatgacccccatgtgacagcaccag 

tgtggccaggggctcactgaccacagagcagtgaaggatgcccacaagtcccgcctgggtctccaggaaggctgtcagga 

ctggagtgagaggcgggtctgcgtggagacgagaggtgggcctgtcaccctcagacaagggcattccctggataccctga 

tacaacccgtgacctctgcaccgctttgtcccacactgccctgcgagaagggggtgatcccaaagtgcctgagtgcctgc 

tgaatacacttttgtccttggctgggctgggtacgtcactctgttgtcccagtcagtgttcaaggccaccctgcaagtgg 

ggataaaagccccacttcaaagatgagaaaactaatagagagacctggtgagggacagcacctcagccaggtgaccaaag 
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tgagcatcatcagcaatgggacaagctgacacagagtgcctcctgacagggcagagcagcatgtccgcggccttcccacc 
cagagggcatggtctcaggccattcaagtcaggcaaatgtggagtgagggacctcctcacaacagcaggcctgcactctg 
caagtttcaacgtcaagaatgacaaggaaagactgaggaacagtctcagactaacggagaatgaagagacgcaacgaccc 
aatgcaatatgtgaactgtgattgtattctggaccagaaaaaaaatggctacaaaagacagtattaggtccactggtaaa 
atgtgaatatagattatagcttagataacagtcttctatcagccaggtgtagtggctcatacctgtaatcccagtacttt 
999aggccgaggtaggtggatcacgtgaggtcaggagttcaagaccatcctggccaacatggtgaaaccctgtctctaca 
aaaacacaaaaattagccaggcaggatggtgggtgcctgtaatcccagctactcaggaggctgaggcaggaggagaatcg 
cttgagccccagaggcggacattgcagtgagccaagatagcaccattgcactccagcctgggcaacacagtgagacCcca 
tctcaaaaaaaaaaaaaagtcttctatcaatgctaattttcctgatcatgatcattgcattgtggatatgtaagagaatg 
atcttaggaatttagcggtaaagaggcctcatgtctgcaacttcaggaatataaatatgtaggtagatacataaataaca 
tatgcatatgtatatagagtgtccatatatgtataagtgcacatgtccacatagagtggatgtgcatacacacaagtgca 
cctgtatatatgcaagtctatatccatacatttatatgtatgtatgtgcgtgtgtgtgtctgtgtagacagcgagaaaga 
ttaagcaaatgtgccaaaatgttaataaatgggaaatctaggttaaaggtatataatcattatttgtctttctctgcaac 
ttttcataagtttaaacttccaaatataaaataggagggaactagggaggtcaaagcattgcccaggtgtgcagagctgg 
gacatgagctcaaggccacctccaggtaagtggccttgaagttcccatgcccaggaccccaacccttccttggggctcca 
ccatctggtcctgctctgactgtgtccagtgccaccccaccctggcctcttacggttgactaccacgctgacagggcccg 
agcgctcgctgccatggacgttctgcacctcacagaagtagaagccagtatcagccctagtggccaagtgcagccggagg 
gtatgggagtgggcatcctccagcaggacatggttcttgtaccagctgtagcggagatcactgggtgcctcgttgggtgt 
gttgcagactagtgtcactgtctggttctccaggatgggacctgctgggctcacctggacctcagccactgcaagggcag 
catagggagtgctggggggtcccagccaacttccagccccagccccatacagtccccgggtctcagccaatcagcctcaa 
tctctgcacatcctacaccacccgcctgctgctacaggggagcctcctggagtgctctgcatctctttgtcctgctcacc 
aggatccccctgaeaccccacccttcgtgggggtattgtcacctgtctcatggtaaggcagggctgggactccacacctg 
catccaggaagcactgggttatgccagttgctggccctgccctgccctgtctcccctccgtccctgaggcctgagctcct 
ccctattctctggccaatacgcaaccttcccaagaactcactgaagatgtggaggctgatggggggtgagaccaaagagc 
ccacgccgttctcagcttggcaggtgtagacgccagcatcgctccaggctgcctggggcaggtgcagcacaccagtcttg 
gtttggaggcgtaccccatccttgagccacttaatggaactgactgcagggtagctgctgttcacctggcaggtgagtgt 
gaccagctcacctggaaggatgttcctccccgaggggctgaggaggatcttcacacccctgggggcatctgcaagtcaca 
gtagggggtattgggtaaggtgcttggggagggcagaggatggcacacttcttcttgcccccctttaaaagctcagtcct 
aaggaagtatgcccagataaaacagcagtccccttcaatcctcacccagggcatgctctgtctgcccatgtccgtccctt 
tcccgccccctgcgcctgatgcattcctatg.cccattgagagctgatcatgtgacgcttggccagcgtccagcctacccc 
accagctgtagttttctgcttcccaatgctgtccattgctcctgctctatttgggatgagctccacacacagggtcatgg 
gtaccccactgctagctttagtggcctgtggaagctattggtaagggaccaa'ctacctagtgggagggggccaaaggcag 
catcaaactagctctgaaaatagttacccagtttgtaagcaagaggccaacaacacaaagaactgcatttccttaaatct 
tgttccaaagcctctctctgtgtattcaagtgttttattcttatttttttagaaagaggatctggctcagtcagccaagc 
tggagtgcagaggcacaatcatagctcactgcagcctgaaactcctgggctcaagtgatcctcctgcctcagcctcccga 
ggaactgggactacaggtgcaagccaccacatccagctaatttttgttttttaatttttttgtagagacagggtctcact 
atgttgcccaggctggtctcgaactccttgcctcaagcaattctcctgccttggcctcccaaagcactggggttacaggt 
gtgagccactgtatccggcctcaagtgtttaatttgtgccaggcactcttctaaatccttgacctgggtcatctccttta 
tttgtttttgttgttgttgttgttttttgtttgagacacagtctcactctgtcacccaggctggagtgcagtggcacaat 
cacaactcaatacaactccacccccggggttcaagcaatcctagtgcctcagccfcccttagtaactgggattacaggcat 
atgccaccacacccggctaatttttgtatttttagtagagacgaggtttcaccatgttggccaagctggtcttgaatccc 
tggcctcaagtgatccacccgcctcagcctcccaaagtgatgggattacaggcataagccaccgcgcccggcccatctcc 
tttaatttttatcataaatctgtgagacaggaaccatctattgttatcttcgtcatagactgagaaaacagagccaggca 
ggaaagggataaatcccacctccccagcatcctccagcacggggtaaatcccggggcacttgctgccaaccctgaagctg 
catgggagctcctgttccccaaccctttgtgtctccctttctctgcccagcctggcctcaaggacaagccccttcgaagc 
agtaggaggttgagggaaccatttaacgaagtccttggggctcagcagtgtctctccacttgccctaccctggaggccca 
ggagcagcctggttttgcatcaggagcaagggttccgtttctgtgggctggagaggggdtggtttctgtcaggagcaaca 
gatgcgctcagccacaagggtgtgtccccagacaactcacacttcacttggaggtgaatctcgctctgagccctgtgatt 
ggccacggagagctggcagcgcaggatccggccgtggtcctgccaggacatggccatgtggagggtctccaggtggccga 
cgccggtgggctcaaacttctggctgttgaaggtgacagagcgagcagggtcctggccttgccactgcagtctgacctgc 
tcctgcaggcatacgtagggagtggagcagttgaagtccacctctgtgccctcgagaagctccaccggggaggcaatggt 
gggcaccctgggctcctctgaggacagagacagcagtgctcaggacccgcttttgccacccctgagatccctcgccctgg 
aaaccccagctgaggagagagccctggggagggggcttttaggggaaggaaagacgctttctactccagccccacttggg 
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tgaatacaagggagaaccaggcctgggcctacgccggggctggaacagaggctgagactggctggggttagattcaggac 

aagggctggggctgagagccaaggggtccagaagcagcttgggaatccctcccggggggcagccaggccaccccacttat 

cacctgttactgtgaccaaggtgcctttcacatctgaccagcggttgacctcactgatctcgaagcggaagttgtaggaa 

ccagagtcctcgggctgcaggtccttcagcagcaggttgcacaccctgtgctcggggttccccatgaactcggtgcggcc 

gcggaagcgggcctccaccagcttggggtccgccgagtggctcaccacctgccgctggcccgagtagtcgtagtaccaga 

tggccgtgatgccgtcgggcacctccacgtcggcagggaagctgaagatgcaggggataagcaggcaagaccccttcaca 

ccctgcacgtcctggggactggagacgccccatgaggcctggcctgggggaagaacggcagggggacagaggggagggtg. 

atacaggcctcagggtgccacagagccctgcgacctgccccagagaaggtgccccagctgggctcccaaattctgccctg 

ccccgagaaatgcacacttagagcagcccttctcagtgccccaggggtcagtccactgcccgaggctgctcagaggtttg 

gtagggtggctcgaagacagatggcagcttcctgcccagctcctggccactgcccgattgggccctcccttgacctcagc 

aaccaaacatgtgacccaggcatagtctaatctgccaagggctggacttatgaaggctggaccctggaagacaggcacta 

ggcccgcaaagcttacctgctgggaagaatgaggccaggaggagaagcttgggcaagaagcccatagcaggttcttgtgc 

tgctcctgttgcctaagagggtggtgcgcactgcgctggctgggctcacaggggcctccagggacacctctgggcacttt 

agccccagcacctgctagaagtccgagcctgtgtccccacctcctctgctggccaacccaataagagggcagggctctta 

aagacctctgagtcagacaccagcagagagccaggaggccacgttcccagctcaggctgtgcccaggaatgccctcactt 

ggtggcctgcctcagaaaagcctgtgtgtcccttggccctatcccaagttctgctttcccagcccctcaaggataccacc 

ctaaggcagatgaacagctgtttctccctcttccccacttccctgccccctccccaccacccaaaccaacaggaactgga 

gcccagagatgcccagttactcactccaagacacccagctagaatgatggtttcttcctgaggcttgtctcctaccacct 

gccttactaactatagaccataatggggctttactgaacttgccgaagtgctgcctttaacagtcactcccctgctcaaa 

aaccttctgtggctccccattgcccgtgagatgtgaaaagtcctcatttcctgcccccagctctgtccccattccctgct 

ctccgcagacccactctgggggcagttctgtctgctcaagggctccctagctgcccagctctatctccaccacagataat 

ctttgcctgctgaaaccttattcaaccttcaaggagcagcatgaatttggcttccagctggaacttctcctttgaggttc 

ctgtagctaccccagagctatctctattttccttgccttgttttttacagcttgtgagagcccatttctcaggacctaga 

actgaagatatgtgccccatagcagtgcggagcctaccaggcattcagcaaaccccttagtgactaagagaggggtgagg 

tctttaggggttcagagctgaggttcagagttggagtggggaggtggcaaggcaagtctaggtttgaaaggtagcatgag 

agcgctgtggaacacataccccacaaatatgagctcaatgtgtgcggagtgtaccatatccaaaaaggcaggccctcaac 

catggagtgcccctggtcagggagtgtctaaggggtaccatagacctgagcccaaaaggaagagatgccagaaacacata 

taagtgaaactataaaactcttagaagaaaacaggtgaaaatcttcatgaccttggattaggcaatgctttcttaagtat 

gaaattaaaagcacgaaaaaaaaaaataggtaagttggacttcatcaaaatttaaaacttggccagacacagtggctcat 

gcctgtgaacccaacactttgggaggctgaggcaggaggatcactttagcccaggagttcaagacgagcctgggcaatac 

tgcaagactctgtctctacgaaaaattaaaaaacaggcctgtggtcccagctactctggaggctgaggtgggaagatcgc 

ttgagcccaggggaggggtcgaggctgcagtgagccatgattgcaccactgcactccagcctgggtgacagagcaagacc 

ctgtctgaaaagaacaaacaacagctgggtgcggtggctcacgcctgtaatcccagcactttgggaggctgaggcatgca 

gatcacaaggtcaagagattgagaccatcctggccaacatggtgaaaccccgtctctactaaaaatacaaaattagctgg 

gtgtggtggtgtgcacctctagtcctagctactcgggaggctgaggcaggagaatcacctgaacccaggaggcggaggtc 

acggtgagccaggatcacgccactgtactccagcctggtgacagagtgagactcttctgtctcaaaaaaaaaaaaaaaaa 

aaaaggatatgaatcaaaggacactatccagagagtgaataaaaggaggacaacccacagaatgggagaaaatatttgta 

aatcatttatctgataaaggactaatatccagaatatataaagaattcctacaatgaacaacaacaaccaccaaaaaaac 

catgaaatccaactccaaaaatgggcaaaacacttgaataaacatttcttcaaagaagatatataactggctgataagga 

catgaaaagatgctcaacatcactaggcattaggaaatgcaaatcaaaaccaaaccacagtgagataccacttcacatcc 

gttagaatggctattcacaaacaaacaaagcaacacagaaaacaataaatattggtgaggatgcgaagttgaaattcttg 

tgtattgctggtgggaatataaaatggttccgtcactgtggaaaacaattgggtcattcctcaaaaagtcaacataggat 

taccatatgatccagcaattccactcctaggtatatacccaaaagtactgaaaacagggactctaacagagtacaccaat 

gttcacggcagcactattccactaaaaggtggaaacaggtcaagtgtccatcagtgaatggatgtggataaacaaactgt 

ggtatgtacatacaatggaatatcaatcggccataaggaggaatgaattctaacatatgtgaaccttgaaaacattatgc 

tcagtgaaaccagccagacacaaaagggcaaatattgtagggttccaattacatgaaatatctagactacgtatattcag 

agactgaaagtagaataggatagaggtaaccaggggctgcagggagggggagctaatgtttaatgattgctgagtctctc 

agataatgaaaaagttctggaaatagtggtgatggttacacaatattgcaaatgtacctaatgtcatggggtgtacactt 

aaagacagttaaaacagtaaattttacattatgtatatttcaccacatacacacccatgttgccaactttgcaatcctcc 

ctggtcctaaatgctgacttggccaagtgaacgaggaggctggaagtggggacaggaaactcatgacctcccagctccca 

gcccatccgcctcaggggctgggctcagcagattccaatgactaccaggggtcacacctgggaagggggtgagccgaggc 

ccagggccagtcaggctgaccaggtgggacttagcctgctgcagaaggcagaaggtgccccagcagggggcacagtacag 

ggcgggattgggacaggaaggacaccgctccccaggggacccagccctctcgcaggctgctggagtggactgatctggcc 
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at ttatggaggcccaagggctcatctccagttctctaggaagccctaggcctcctcctcttctgggaagatgcaccccca 
gcctccacaccaggttcttggccactggagaatgatatagctggggccctgggacctggacacctcaccgtgaagaaaag 
cagcctgctgggcacactgggggtcagatgtgtccctggccacaggggatgtcagggtcagctctgctatggccagggca 
gctatcttgtcccagctcccctgttcctcccatttggggtcctgaaaagggcaatcgtgaacctgatggaagaatggtgg 
ggttctggacacagcgaccctggaacagggcgcgggggaggaccctttccaggaccactcccatcacataatgtagaggc 
cacctatgcttagcccggccctaaccccaaaggggtcagccccaccggaatccagcctattggctcagcctgtcaccaca 
aaggccagcttcagcccagataactgttctggaaacagaaagagcagggaccgctcagaaaggagatctctgtccctgtt 
tgaaagcctggagttgaggggacagtgccccgccccccgccccccgcaacttgggttgcagctgtggcctagtgagcacg 
cagcgccccctggtggtcgagggggaattgcgggtcccgggaaagggggcggtgtgccagcaacagggagcaggcagctc 
tgcagccctgaaccatccctcccttgggtgactcttttggaaatcattgttccccagacaggaggttcctgaggttcata 
cttgggtctccaagtcttgggtgctctgaagacaggattttaaatccccactcctactattggtatgtgttgcatcaggg 
tggcttgagctggcctggtacacagtgggcactcacgtttgctgcctgtttgagaccaagtgcctcaggaggtcttggag 
ggttggctggggccccaagtccctgacctctgattccagaggccaagtttagctggggaagaaagggcagaggcagtttc 
cctatggacagctaggcccgggtgtaggattcagtttctgtttcctgacaccaaggcttctccccaacttccccattggg 
ctagagaaggaagaacacagggtgacatggccagctggaggttactggcccacagatagggagtcagggtacggatgggc 
aattcctggagcagattatggtcaaaatagtggaaatccccaatcacaggccaaatgtttaattctcagagccatagaat 
ccataactaatgcttattggctttgacatgggcagtagaaatttcacacttcattccttaatctggctctaaatgcttct 
ggctggagtgctcactctccaaactgtgctggacagcaccagaagccctcctgtggacggacgaagtggtgatggatgaa 
gtggaattgtgctggggttagagtaaggagagattatcgttgggctgggctggggctgaggtcagggtgatgatcaggtt 
ggaattctggaagcaaattaaggctgggattggggtggacattgaggttgagtgatgggtgggggtaagggtgaaggttg 
gagttgggtgcaggtgatggttaagatacggtggaggctgggttgagatgaggatgatagagtcggagttgtggttaggg 
ttgggatgatggatggttgggattagatgagatgagtaaatggttagggttggggtgcaagtgaggtgagggtgagataa 
ggttggaataggggctggggctggggctgaggtagggtcagagcagggtgatggtcgggactgggattgggatgcaggtt 
ggaaggatttgggggtggtggaagggtttcagttgagtccatctttgattagtgtcttggacttgggttggtttggggtc 
caccactcgcacccagatggagcccccccccgacccctgcccctatcccgctcagccagtttcagcccagccccgctcct 
aatgctccactcaccctctgggcccagggaccagggacaggggtacctgctccacagaaggaagtggctgcggcggtgct 
ggacctgcggagaggagaacaggaaggacggccaagagctcctggtgcagctggctccccagggctctgccggctcaaga 
gagaaggatcccgtatcaggggctgcttcctctttcccaaagcctcagctctactgtccaacccagaggctggtcaggga 
ggcagctgcaggccttgcgcaatgccaaaacgggaaagacctccataggggaaggccctcggcaaggccagggacttagg 
gactccagcaagcagaagtgggaccgctgcaacgctggagcctccccaggcaaagtgagaaatggagtgggggactccca 
ttcaccagccaaatccacaccccactctctctgagccccttagggaggctgggggaggtggaagggggcttcctgcacac 
agctctcctccctgacacctgagagggaggcgcgcccagcctggggtggggatgcactacacgatgcccagaccaaggca 
gttattctccaagccattcaggagcctccctgtaccactgagtactcttaagaacccccaagaggcagtcccgtgttgtg 
ggggacataaggccccaatgtccaggacactcctcaggttttctctttcccctctcactgtcctgtacgttttttggttt 
ttggcttcttgttgttggtttttttctttttgttttgtttttgcttttgagatggagtctcactctgtcgtccaggctgg 
agtgcggtggcacaatctcagctcacggcaacctccacctcccgagttcaagtgatcatcttgcctcagcctctcgagta 
gctgggattacaggcatgcaccaccacatccggctaatttttgtatttttggtagagacggggtttcaccatgttgtcca 
ggttggtcttgaactcctgaccccaagtaatctgcctgcctcggcctcccaaagttctgggattacagaggtgagccgct 
gcacccggccactatcctataccttcacccccacGttgggacaggaaaggaaagcccccaccacagctagttactgttac 
attactgcagcccatcttattgagggcctagtttgtgcaggcacttgacatgtgaagcagatgttaattgctccaagcaa 
tcccagatacaagcaacaagactttttttggtttttgtttgtttgtttgtttttgttttttgagacagtgtcttgctctg 
tcgcccaggctggggtgtcctggtgcaatcttggctcactttggctcactgcagcttcgaaattctaggcttaagcaatc 
ctcctgcttcagcctcctaagaagccgggattacaggcgcttgccactatgcacggctaattttttaagttattttgtag 
agacggagtctccctatgttcccaggctggtctcaaacacctgggctcaagtgactctcccacctcgacctcccaaagtg 
ctggaactacaggcgtaagccaccactcctggccatttttttttttaatttcacaactcaaacttaattcaactcagtcc 
ctgcctacctgtactggtggggagctggccacagcaa.attaggcccctgtaccctgagggcaaggccacgtgtgcaggtg 
taaaaggatgtgaaaccctaaatcagggtggatccagaatcgcaggccatggtgccccaaagcagatgtctggtgacatt 
ccaccctgaaatgctcaggctacagagatattaggtctctatcactctgttcctctttatagctcctgtgtccactccta 
tgcttgggccatttcctctttcggccaaaacacaaagggttcatcccattacttcctccctcaacagctggtccggagac 
acccagctctaggcctgtggggttgtgacacatgggtaccaatccttcagtccactgggactctatacatccaccccttg 
gcttcatggtgggaaagcatcccttgactttgaccttagtcatatgacttgctctggccaatggatgtgcacggacatga 
cacaaactaagtcttgaaacgtacttaagcagtttgcctttccccctgaatttctgtcattgccatgcaaggggcaggct 
ccaactaacctgctggtccaaagaggatgacgaacacatgcagatgacctgaatcagacccatggattgaaacaaaactc 
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agctgagcccagcctacgtccaccagaccagtcaacctgtggatgcgtgaatttattgctggatgctgctgagaattttg 
tggctacattagcaagatgatacaaggcctaagtcccagaacaacacacccagaacttgcttacctttccttagcatgag 
gagagcaaagacttgtctaccttgattagtcagggagcactgcttcctgtcatttccttgagtatacagcaaactaggta 
aataaataaaaataactaggtaggctgggcacggtggctcacgcctgtaatctcagcactttaggaggccgaggtgggca 
gatcgcttgaggccaggagttcaagaccagcctggtcaacgtggcgaaaccctgtctctacgaaaaatacaaaaattagc 
tgggcctggtggcaggcgcctgtagtctcagctactcaggaggctgaggcacgagaatcgattgaacccgggaggtggag 
attgcagtgagccgagatcacactactgcactccagcctggatgacagagcgagactctgcctcaaaatattttaaaaaa 
tgtaatttctcagtaggtcacactgttacacattcatctaataacaattattctttttctttttttttttttttttttgg 
agatagggtctcactgtcacccaggctggacaggctggagtgcaatggcacaatctcggctcactgcaactttgacctcc 
taggctcaagtgatcctcctgcctcagcctcccaagttgctgggactacaggtgagtaccaccagacgcagccaattttt 
gtatttttttgtagagatggggcttcaccatgttgcccaggctggtctcacactcctgagagttcccagtcaagtgatcc 
acccaccttggcctcccaaagtgctgggattacaggtatgagccaccatacccagtgaattattctcgttccagatagga 
aaaccaagtcatagggaggttagagaatttgccaaagacaaaactttttggttgaaaaaaaataagttttgctacaagta 
tagaaaacaccaaataacggtgttttaaataaaatagaagtgtttcaccctctccctcaagtaagtgttggcatccaaga 
tgatatgacaactccacaatcatgaaactagatccctttttattttttggctcagtcattgtcaatgggctgcttccagt 
cattgtccaaagtggctgcttgcgctccagccactgtatctgcattcgggaaagcaggatggaggaaaggacaaagtagg 
ccatgccctctactttaagataactccttaaattgctgatgtggtgggtcactcctgcaataccagccactcaggaggct 
gaagcaggaggttcacttgaacccaggagcttgaggctgcagtgaactataattgtgtcactgcattccagcctgagtga 
cagagtgagatcttgtctcttaacaacaacaaaaaaggtaattctttaaattgtacactttattgctgcttatattccac 
tagaagctagaagttagtcacatggtcttaagtctttattctaacaggtgatgtgcccgagtgaatcctgggggcccagt . 
gctaagaagtagagggagatgaagccctgccctcctgtggagcaaccaggaccttaattcagtcattcattccttggacc 
tacccagattccagcctcaggcccctgcccacctccttgccagtatctctgcagagcctcctgcctcccccagagggtgc 
cagagccctctgcttcctcagccttcagacccacttgctgtgtctcaggtggggacaactcagtctatagaggcccaagg 
gagatctttgagacccatactgtagtcagcttggggcagatgggagtcatgtagctcactggctaaaagcctaggtcctg 
catagagctgggttcaaatccaggcatatccatcacctgctctacgatcccaggagtcactcaatgactcagaagctaga 
gtcctctggaaagcttgt atgaagagttcccagggcctggaacatcaaaagagctctgagaatgttggctctgctgttgc 
ttctcttattgggtatctgaaggccgtactctcctctctcctccacccgaggtgacctctactctcatagccacaccctg 
gaccctcactcattcagtcacctcacccagtctgtcatctgtccctctggctctcctctcctgttcatgcatcttgtact 
cacccacttctgtccctgagtgtttcaaggctctggggttttcagagacactgaagggacctccctcctcaacacaacca 
caaggtctaggtggaatgacccactaagggaccggctctgaagccagagacttccagggaaagtcaacaagcccaaggat 
gcccgttacaagaaagttaaaagacccatgtacatgtcctcccgttttattccctgctcagggtctgggcatacagtgga 
acacatgcagtccccaaaggacaccgtctgtacagagtcagatggagttaagaacatttagccggctgggcgcggtggct 
cacgcctgtagtcccagctactcgggaggctgaggcaggagaatggcgtgaacccgggaggcggagcttgcagtgagccg 
agatcgcgccgctgcactccagcctgggccacagagacagactctgtctcaaaaaaaaaaaaaaagaatatttagtctgg 
gcacggaagcttatgcctgtaatcccagcactttgggaggctgaggtgggcggataacctgaggtcagaagtttgagacc 
agcctggccaacatggtgaaaccccatctctactaaaaatacaaaaattagccaggcatctgtaatcccagctactcagg 
aggctgaggcaggaaaatcacttgaacccaggaggtggcggtcacagtgagccaagattgtgccagtgcactccagcctg 
ggtgacagagcaagactccatctaaacacacacacgcacacgcacacatatttaaaccaccacaccaacatctagttcaa 
gatggtggactgagaacttgtctctgccattcctggcccatccaataccactgagagcacagtaagcaaagggaaaagga 
gacagaagggctgggaacaggatggctgggggatgggaagtatccactgcacaggattttgatttaattctagaagatag 
aaagagggaggatcacgttcaggaacagatgcgggtaaaggaaaccagagccaaagcacgctgagagaaagctgccccag 
aggccggagcagaagtggactctctacagggactcaatacaccccaaagggttggtagctggcacacgtacctctccacc 
cccacatgtaacactgcagggcagaggaaataccctaggtgagttccagtatcaatcaacctgccctttgttcataaaga 
tgaattggttaccaggtattaccagacaggtgaggaagaccaacacaaagagaaagatccagaaacaaacaggccaggcg 
cattggctcacgcctgtaatcccagcactctgggaggccaagatgggtggatcacctgaggtcaggagttcaagaccagc 
cttgccaacatggtgaaaccctgtctctactaaaaatacaaaaattagctgggtgtggtggggttctcctgcctcagcct 
cccgagtagctgggaggctgaagtaggaggttcacttgaacccaggagctcgaggctgcattgaactatgattatgtcac 
tacattccagcctgagtgacagagtgagatcttgtctcttaacaacaacaaaaaaggtaattctttaaattgcacactac 
attgctgcttatattccactggctagaggttagtcacatggtctcatgtctttattctaacaggtgatgtgcccgagtaa 
atcctggggccccagtgctaagaagtagggggagatgaagccctgccctcctgcttgaaccctagaggtagaggttgcag 
agagcggagatcatgccactgcactccagcctgggtgacagagtgagactccatctcaaaaaataaaaaataaataaata 
aataaataaataaataaataaataaaaacctacagcaaagaacaaagagctattgcaccgttactgtgggcctggcagta 
ttatgacaacttttttttttttttttttttttggagatggagtatttttctgtcacccaggctggagtgcagtggctcaa 
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tctcagctcactgcaacctctgcctcctgggttcaagcgattctcctgcttcagactcccgagtacctggtattataggc 

acatcccaccacacccggctaatttttgtatttttagtagagaccgggttacaacatggtttcaccatgttggccagtct 

agtatcaaactcctgacctcaggtgatccacccgcctcagcctcccaaagtgccaggattacaggcatgagccaccacac 

ccatccagtgttctgacaattttatacttattaactcatttagcaaccctctgagctagatgctattgttatctccactt 

acaggaaactgagtcacagaatggtacagtaaaataaccttgaccgaggtttacatggctggtaagtggcagagaaatga 

aacttgaacccaggcaacctgtttcctgagttggactctgaactactcagccatactgcctatttatttatttatttatt 

tacttacttacttatttattgagatggggtcagccgggcatggtggcttacgcctgtaatcccaccactttgggaggctg 

aggggaacagatcacttgaggccagcagttcgagaccagcctggacaacaaggagacagggtcttgctctgtcacccagg 

ctggagtgcaatggtgcaatcatggttcactgcagcttcgacctcccaggctcaagcgatcctcccacctcagcctccca 

agtagctgggaccataggcacccactatcatgtccagctaatcgaaaaaaaaaaattatatagagatgggggtctcacta 

tattgcctagactggtctctaactcctgggcttaagcaatccacccaccttgccctcccaaagtactagaattacaggtg 

tgagccaccgtgcctgaccaatacttcctctttaataaactaaacaaaaagtgaacaaaccaatcccggagggaacagat 

aattcaaggaatacaagaaaacttatgaaaagaaagattctagtgccaatatgttcataaaacaaaaaaaagcagctatg 

aaaagaaagcaaaaaataattcttggaaatttaaaatagaattgctgaaataaaaagaaattcaatagaaaagggtgtga 

acatctgtggttttggtttccaaaattagttcactcttctttgggtaaaaataccctgattttcctgctgtattctcaag 

ccctgtgggttggtggaattgacacctcttccatttactgcacgagagaagcatgttgctgctcacagttcattatgagg 

agccagcaggtaaagacaacactgagggcagatgaaagatatatactgaccctgggtccttgtggacatcattgagtccc 

tggatcaagccttacctgaagctaaagggatgagtgcttctcactgttaatagccactgatcaaattccatagaaggact 

ccaattatccttgcctcagttgtgtgtccagccctcggatgagctaccatcaagggaaactgccagaccttggtgacaag 

cccacccactgaaaatggggaaaaaacccaagtcacatcattgaactgaaagagctgcagttccccgaaagacaggagag 

aaggaatgctggataaacacaacaactacttcatctcactgcacctgcaaagaatccaggcacaggaaggtgggcacagc 

aagtagtctttctggttgacataacaaagacaaca.tctgtttcttctctcagaaagaagtaagaaaggcaattggagaac 

cagaaacgacaaaatctattccaggaccaaatagaaagaaaacacatctgaagcatgattttgaacaattagtggggtgt 

aagaaaagagaatccatttgaccttgacatctgaaacagcaaatattctccatcaaggcactttagggtagaggaaatga 

tactatgtttccttctcaaaggagctatctggttacgtggttctgcagtaaatgacgtttatacaatcataacatccgtt 

ggttttcaattttcagaatcaacctgaagataaagcatggaagatttcattataattaccgaacagcatgcaaatgttac 

aaactttgaccatgaaaatgtaaaaaagagcccaggcgcagtggctcacgcctgtaatcctagcactttcggaggccgag 

gcgggtggatcacaaggtcaggagtttgagaccagcttggccaacatagtgaaaccccatctctactaaaaatacaaaca 

ttagctaggcatggtggcacgcacctgtagtcccagctacatgggaggctgaggcaggagaatcacttgaacccaggagg 

cggagtttgtggtgagccaagatcacgccgcggcactacagcctggtcaacagagcaagactccatctcaaaaaaaagaa 

aagaaaagaaaagaaaagtagacagcagaaattagagggagatggcaggtgacctggggctgggggaggcacaatttttc 

ttacatagtgatagggctcaagaaattctttctaaaattgatgtatgaggaacgaaattttaaatatagattgtttagaa 

ctataagtagcaccaacacaggactccagtaatcacacagcaaacaaaactgagaaaacagacatgggcggggagatggg 

aatgggtgatttccattccctgctttaatgatgaggtgtcagtagatgctctcttgagttactaaattgagaaacaaaga 

taaagtatattgtttagaggagtgatgttcaaactccagaaaaggctgggcgcggtggctcatacctgtaatcctggcac 

tttgggaggccgaggtgggcagatcacttaagcccaggagtttgagaccagcctaggcaacatgatgaagccctatctct 

actaaaaaaaaaaaaaatacaaaaaattagcaaagcatggtggtgcacacctgtagtcccagttattcaggatctcacta 

ctgtactccagcctgggtaatgagagtgagaccctctctcaaaaaaaaaaaaaaaaaaaaaaaagaagaagaagaagaaa 

actccagaatatataaaaatgaaaagagggcggaagagtggaactgcggggtagaaaattttgcattttgtttcatctct 

ctgtatatgttgaactttttttttaacctacatgtactacgtcatatccacagcccccaaccatccaccaggcatcaact 

gctgtgtttatatggagggttgagcaatcattcctgcctccttccagttcgtcttcctgtactgcagaggctagaaaact 

aaatttatatcacccagattcccttccagctaccttaatgctaccacctcattcagccatcattgattctccacttcacc 

aaactggtcaataattgtctcttctaaaaatattagaattgggactaagagacactacagctgacccttaaacaacatgg 

gtttgaactgtgctagtctgcttatacacagatttttaaaaaaataaacatattgaaaaaaattttggagagatgtgaca 

atttgaaaaaactcacaaaccacatagctagaaatatcaaaaaaaaaaaaactgaaacagagttaggtaggtcaggaatg 

cataaatatatatgttaattggctgtttatattattagtagggcttccagtcaacagaaggctattagtaattaagtatt 

tgagtcaaaagttatacacagatttttgatgatgaggaggattagtgcccccaacccccatgttgttcaagggttagcta 

tatttgaaatctgctggtagctggaattgggcaactaaattcacgagatgtggaacggtcgtatacatggagaataaata 

aataaataaataaataaataagtatttgttggcctggtgcagtggctcatacctgtaatcccaacactttgggaggccaa 

ggcaggcagatgacttgaggtcaggagttcgagaccagcctggccaacatggtgaaataccgtctctactaaaaatacaa 

aaatcagctgggcgtggtggtgtgcacctgtaatcccagctgcttgggaggctgagggatgagaattgcttgaacctggg 

agtcagaggttgcagtgagccgagattgcaccactgcactccagcctgggcgaaagagtgggactctgtctcaacaacaa 

caacaaaaaagcaattgttaaaagaataagaaaacaagccacagattgggagaatatatttgcaaaacaaatatctgatg 
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acccaaaatacacaaagaactcttaaaacccaacactaagaaatcaaacaaccccattaaaaaatggacaaacgggccag 

gcatggtggcggtggctcacatctgtaatgccagcactttgggaggccgagcagggcagatcaccttaggtcaggagttc 

tcaactagcctggccaacatagtgaaaccgtctctactgaaaatacaaaaattagccgaccgtggtgacgtgcatctgtg 

gtcccagctacttgggtgtctgcggcaggaaaatagcttgaaccagggaggttgcaatgagctaagatcgcgccactgca 

ctccagcctgggcaacacagtgagactccgtctcaaaaaaaaaaaatgggcaaaagatgtgaacagacacctcatcaaag 

aagatatacagatggaagataagcatatgaaaatatgcttaacctgatgtcattaaggaattacaaattgaagcaacaag 

gtaccactaaacccctattaaaatggtcaaaatccagaatgctaaaaacaccaaatgctggtgaggatgtggagtgacag 

gactctcattcgttgctggtggaaatgtaaaatggtatggccatttttaagacagtttggcagtttcttacaaaactaaa 

catagtcttaccatacaatctaacaattacactcctagatagttacccaattgaggtgaaaacttatatccagggccgag 

cacagtggctcacacctgtaatcccatcactttgggaggccaaagaggaaggattgcttgaggccaggagttctttcttt 

ttatttatttatttatttattttcttttgagacagagtttcgctctgtcacccaggctggagtgcagtggagtgatctca 

tctcactgcaatctctgcctcccggcttcaagcgattctcctgtctcggcctctgagtagctgctgggattacaggtgca 

cgccaccatgcccagctaatttttatatttttagtagagacagggtttcaccatgtcagctagactggtcttgaactcct 

gacctcaagtgacctgcctgcctcggtctcccaaagtgctgggattacaggtgtgagctactgtgctcagccaaggccag 

gagttccagaccagtcctggcaacacagtgatactctgtctctacaaaaaaaaatttttttaattagtcacatgtggtgg 

cacacacttgtagtgtcagctactggggaggctgaggctgaggatcacttgagccccaggagtttgaggttgcagtgagc 

cacgattatgcctctgcactccagcgtgggcaacagagtaagagaaagaaagagagagagagagagagagagaggaagga 

aggaaggaaggagaaaagaaacgaaaagaaaagaaggaaggaagggaaggaagggagggaaggaagggaggaaggaagga 

agaaagaaagaaaggaaagaaataaaaaaggaaaagaaaagagagagagaaaccttatatccatacaaaaccctgcgcaa 

gaatgtttatagctgctttattcataattgccaaaaactggaagcaactaagaagctctccaatgggtgaaaggataaac 

aaactgtggtatatctgtgcaatggagtatcattaagtaataaaaagaagacttagcaaaccacaaaaagtaatatgcat 

actaaatatgcatttagtaatattaatatgcattaagcaataaaaaaagacgtgtcaagccacaaaaagtaatatccata 

ttactaaatgaaagaagccagtctgaaaaggctacgtgctgtgtgatttcaactatttgactttctggaaaaggcaaaac 

tacagacagtaaaaagatcaagggttgccgggggttctggggacaggaaaggatgaataggtggagcacgggaattttag 

gtcagtgatattattctatatgatactataatggtggatccatgatatgcttttgtcaaaacccatagaaagtacaacac 

aaagagtgattcttaatgtaaactatgtgctttagtcaacaatgtattgatattagttcattaattttaacaatgtacca 

cactaatccaagatgttaataatggggaaactgcgtgtgagggaggggatacatgggaactctgtactacagctcaataa 

ttttataaatctaaaactttttttttaaataaagtctcttagaaaacaataaaaataaaataaaaagtcaattttttttt 

ttttttttgagagggaatcttgctctgtcacccaggctggagtgcagtggcacaatctctgctcactgcaacctctgcct 

ccttagttcaagcgattctcctgcctctacctcccaggttcaatcaattctcccacctcagcctcccaagtagctgggac 

tacaggcatgcgccaccacgcctggctaatgtttgtatttttagtagagatggggtttcgccatgttggccaggctggtc 

aagaactcctgacctcaggtgatqcgcccgcctcagcctttcaaagtgctgggahtacagggttgaaccacatgcctggt 

ctaaaatgtccatttttaaaaggcagtctggctggggcagtggctcacgcctgtaatcccagcaccatgggagaaggctg 

agcagggtgtatctcttgagcccaggagttcgaggctgtagtgtgctatgatggtgccactgcactccagcctgggtgac 

agagtgagactctgtctctaaaataaataaataaaaatacaataaaattttaaaaggcaatctgcagcaagagaagatga 

agttagcaggagagcaggacagaggagagtccatgaacaacagggagaggaaagtggtcatggtggcagctcccaggctc 

ctggcatatgttcaattccctgttcccagccctcaggaagcccagatgtccccacctgcccctacatacaaaccctggat 

ccttgacatcaacttctcctttccttcatgtgctctaatgagtttttgttacttgtggaacatttaactaacatcattaa 

ccaagtggacctgtctcagagtggtgttatgagaagacagcagccaaaagacagctgcagccaagcacagtggctcatgc 

ctgtaatcctagcattttgggaggccgaggtgggtggatcacctgaggtcaggagttcgagaccagcatggccaacatgg 

cgaaaccccctctccactaaaaatataaaaattatccgggtgtggtggcgagcgcctataatcccagctacttgggaggt 

tgaggcaggagaattgcctgagcccagggggcagaggcggcagtgagccgggatcgtgccacttcactccagcctgggtg 

aaagagcaaaactctgtctcaaaaaaaaaaaaaaaaaagacagctgcaacaaatgtcaagttctgtgtgttttcttttct 

tttcttttttttctatttaattaatttattttagagtcagagcctccctatgtcacccaggctggagtgcagtggcacag 

tcacagctcactgtagcctcaacctcctgggctcaggcgatccttccacctcagcctccttcctagctgggactacaggt 

gtgtgccacgacatctggcttgtgtgttttcttttctttttttttgagacggagtcttgctctgccacccaggctggagt 

gcagtggcgcgatcttggctcactgcaacctctgcctcctgggctcaagcaattctcctgcctccgcctcctgagtagct 

gggaatacaggcgcacaccaccatgcccagctaatttttgtatttttagtagagacggggtttcaccatgttggccagga 

tggtctcgatctcctgaccttgtgatccacctgactcggcatcccaaagtgctgggattacaggcgtgagccactgcacc 

ctgcctggcttgcatgttttctgacatactgtcaaaaggatactcatactaaatggcaacacattctcaagccccttcct 

tttcttctcctatctgctttaccacacaccatagctgcttttaccgttttctccttaaaaactcaaaaaaaccttccccc 

aacctatctatccacttcttgtcctcccctcaaacccactccattttgattctgcatcataaaaaccaagcagagttctg 

gggaggcagaagcctcctcttatgatattggaagggggtggatgaaattgagcacacaaagccagaatcctcttttgtgg 



FIG. 7 



WO 01/78894 



PCT/US01/12245 



44/157 



aaagatgggggcagtgggcagagggaagcaggctcatctctttccctctcttcccttcccttctctttcacaccacacgc 

tccgcctgggtgagctcatcgtcctcgtgtcttcaatttccacctccaggagatcgactgccaaatttctacccgcaatc 

ccaaactccactctgagctccagacccatctactaattgcctagtcaacattttctctgtgggatcgaatcagataggat 

tatattctgctgtgactaacagagactgaaaattcagagtcctaaacaaaagactttctttttcttgtgctgaaaagtct 

ggaagtacacagccctaggctggaatagggatactgctggggggctccgcctccttgcaggctgcccctctgctatctcc 

agggtgtgtccctcaaccacactgtccaaagtggtgactgaaggaccagtcctcccactgacattccaggcaacagaatg 

gagaagaacaggctgaggaaggaggaaccacaggcgccctccagctgtttcaaggaaggttcctggaagatgctaaataa 

caattccgtttatatctcattagccaggcttagtcacgtggcaacacctagaagcaagaaaggctgggaaatatactgtt 

tctgctgggttaccgtatgcatgactaaaaagagagcgctctgttcttaggaaaatgagaatagactgggttgggggagg 

ataaccagcattggctacactgcagccccacaacctcctcatgcttgatgtggctgaaataaaaagagcatctcccccta 

acatttcctttcccttagtcttgttccttttctgtaccccataccctagcagaggggcaattctgtggggtgggctcact 

ggcaaggggaggtctcacccaagctcagtcaagggtgagcagagagaattgtacaaggaagctgaatttccagctggtct 

ggaggaaacactgaaaaccccaaacagaagatgtccaggtggagcaaatgcagctgagagttcttgtccaaggggacaga 

atgtcagcagagtaggacacaagggcaggtctctactgaaagcaccaggggcaaagtcacccagcccttcaaccggctgg 

ccaccaagcacatcggctccactctgcctcctgcccgtgccaccctttgcacttttgtctgtaccatctcctccaccagg 

agtgcccttcccgctcctccatgtggtatactctcagccacccctcaggctcagatccagagccacctcctccaagaagt 

ctccacctgtctgctgggaaccctcacatcacactgggagacgaagccaaaagtgggggggtcagtacagaggtgtggga 

ggtaggtccaatcctggctggacctggctttgggtgccatttgggacagggtattggggaggttcagcaggaaggtggga 

cctcaaggctttctttttatgaaggaggaggaaccagctgttgacagagcagcaacagcagggagccaccaaggctgagc 

cttgaaccctgggccaccaggctcctgcaggcaactgggagcagctcagcctctcacagtcccatttccagggaggaaga 

tgatgcagcctgaggcaattggccacctggaggaagttcagctgggtggggcaggcagaggaattcctggaaggaagctc 

acccctcagcactggggtagagggacactagagacaggggtggcccaggagctgcccacctcctatccccatgaataagc 

aggtgggcccgggtgaccagctgggagccacccctgggtgctcaggtaccatgcttctggcccagcctcctgagctgtgg 

cacaggcaagagcaccggcccacggggtctgcatggggcagtgacctgttctctctaagcttcagctttctcatctgtag 

aaagagggtgaagagtccttccccaccatggttgtggtgagggtaactgaagtgtcagggtaggggacccagtggggcac 

cagcacacctgaggttcagtcaacaaggcccactggtgaagaaaacctcgcccaccctcccgtccctcctgaggcaggtg 

gtggaggcacatgccctctccattgtgtgtgcatgagtgtgtgggtgcatgcatgtgtgtgcatgcatgtgtgtgtgcat 

gtgtgtgcatgcatgtgtgtgcgaatgtgtgtgtgtgcgcacatgcatgtgtgctaagctctggagaacaagcagccagg 

cttgggcaggagtccgggggcaagggggggacatgcaaagttctctccacagcccctcagtgctaacaggctgaggtggg 

ggatgacatgtcttagaaacaaacttcaccctctgccctaaaactgcctgctggtctgccacatcgggccaagctgtggg 

tacttggaaagagggccctgctcctgccttccaaacccaggaagtcttgaaatccccatggaggggacctggggccaggg 

gactccccaagcagacacagcacccacacaagggtctaagggacccctttctccccaaccgcctgaggtagggatggatg 

tggacaagcagctcagggctggggctggcctaggtggaaggttcagaagcacagaaggcaggacgtgcccaaggggccgg 

acatggggttggaggctaggggctggaaaaagagtttgggaggtcagccaagggcagtccacgccttggctcttctctgg 

gtcatggggagccacggagagtataggagtggtgatgggatggccccagagttgtgcctttgaaagtcacctctggttgc 

agatagagatggactgggggcagggctgagaggaagggcatctcatgagaggggaaatccctgaccacagtccagaggaa 

aacagcgggcctggggagaggctgctcctggggaggacctactcagaggctgattcttcctcgtgctaccccacccaggg 

agatgcaggaagctggtgaggtcagtgggtgggcccacaaccatgctggagaggagcaggagggagacgggagggcacag 

agaggtagagggcagggactctgaggaccctcatccatgcccacctggacaattgaatcgagcttcctctagtgtggaat 

gtgggctcactggggctatttgctatccttctctcatgtcagactggatgcggcccaagggtggggtctgtgactccctc 

atcagactgaggggggccgaaatgtgggggtgggggaaggagggagcttccaaacaggaaggtttggagggcgctggcac 

tgaggacaggggatacgccccccatgagtcttcacccctgctcagagcctgctgccgtggccctcagagacgagcccatc 

tttggagctggagttggggacagggaaaccaggtggaagggaaaaagaaaaaaggggcaggcatgagtgtgaggggccag 

accctgcagaccccaggacatgcacagcagacacacactccaatccctcctggctctgctcagcctcaaacgaggtgcag 

gcctgcaggccccaaccaaggacccctgttacacacagacacaaagacacaagatgcagcaacacacacacgtaatacag 

aa'acacagtcacacactagcgcacacacagacacagtcagacaatccaagaaacacactaacacatgaacagacatataa 

gcaaagtcttacacatacaaagacacacttaagcacagacacaaacacaccaacagafcgcacagacggacacacatagac 

gcacactgggaggcggcctgtggagactgctgattggataccatgttggcttggtatccagaattgttggtgcagcgcac 

tccccatgcctgacctcattaaggaggccctctggggcagtggctctgagaagaaaggcagcaagcctagggaggtgaca 

ctgtagctggggagcattttggctgggcagagaggaggaaggagagcatttcagggagggaaaatagcccttgcaaaggt 

ttggagtggaagagcaaaagggatgtcccaagaactgtgcctcatttgatggggctgtggggcccttctctgacgggcag 

ccttcccaggcacaaagggaggtggactgagaagcatctggaacttctgtgggggcacatggggcagaggggaaggggaa 

caggcagaacgagt cage tgcctttgaaagctccacgagggacctagcagaaattcatt cat tcatccgt tea ttcattt 
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atttatgacaggttctctctctgtcacccaagctggagtgcggtggtgtgatcacacttactgcagccttaaactcctgg 

gctcaagcaatcctccaccccagcctcctgagtacctaggactacagacgtgagctactacacctggctaatttttaaaa 

ctttttagagacagggtctccctttgttgcccaggctggtctttgttgcccagccagcttcaagccatcctcccacctca 

gactcccaaagcgctggattacaggcatgagccacgggttcccagcccagaaagagaccttcaccccacaaaccccctgt 

ctggaggccaagtcccagcatccccagaagggccccggttaggacaagaaaaaggctgtgtgcctcggaggagaggctgg 

gtgccccctggtgcggagtggtccccttgccgggcatggggcctcagtctgtgagactccgcgcctccgtcgtctgaact 

tgagcaaagacagagctaggcctagcctggggacccgagggcctgccctcatccgcaattctgcctcttttctggccact 

gaggaccgggctagggaagctgtaggggcagtaggggtgggaggagagagggttccgggtcaagggactggaggggtctg 

gggcagggaagtggaggggtctagggcgggcccttccccctccgctgttgggccctcgctgaaggggccgaggctgagcc 

ccaaggtgggcctgtgggggcggggcgggggagcccctcagcagctggaccgcaggaggccgaccagggtctggaactcc 

tcttcccctccctccgccgccccctctcccttctcccaccccctgggctgctaggggagcgggggaggcgcaggaggggc 

tcagggaggggccctggacgcgggaccaggctgggcccctcggcggaggcccgcgcaggcaggccctgccgctggcctcg 

cacatctgggccgggagcgcgggccgacccggcggcgcaggcggcgcggccatccggcctgggggagggggcggcggccg 

gggtgggccggccccaagaaggcggaaggcctcagggccgggcgcacgttgagggctgcgaccgccgcagcgggcacggc 

ccaccgagctgggggcccgggatcgcggcggctggacggggctggagctgtcgggagggcggaggtgagttctggggcgg 

gggctgccgggcgccccgagtggagaaaggcgaggaggtttgccgtcccgggctgtcggctgagaccccaccaaaaacct 

ccagactttgtctggtggggacagatctgcaagcccctctctgcagcgtggctgcgggctcgggaaggcacttggggcag 

cgaccttggtctcccacctgggcttcgggggaccacctcccctgtccctcgcacagctgctcagaggcttaggctggaga 

aactcctgtgtcttaggggctggcaggactggtgtgggctggggcctgctccagggagccaaactgagggacccggtgcc 

taggtctgaggtggaaacctgggagttgagtgtatgtgtgtgtgttggggggtgcctcaggggtcccaggccctgtggtc 

ccgaagcgcaacagagtactgagtggcccaggtgccagggtctgaggctgggacctggtcttccgggccagcttgaggtc 

caggtggggtggggtccaggttcccgctagggcagtggagcctatctgtgaggttagcgtgtgttgatgagagagactgg 

aagggagtgatcacgaaactagggaccctccggagagcagacgcagcgcaggaatcccaggccagaggggtaatgggggg 

ctgtgggcagagcatgggaggggctctgttgtttgtgaactggctgctcccatgatgaggacccaagggggtgcggcagg 

atgtatggaaagcaaagatgaggacagggaggggccatgggaggccatttctgtcactccctgccccagcgcaggatggg 

gcagccacctcactcagccacacatggtcccatcttcttaagggctcctctccagaacagagtggggcagccctttgggt 

gccgtttctgatctgtgctttgggccctggttggcccagtctgtggcttggaagaaacgcctgtggcctcacttagcccg 

ctgacggagtcctgttccattacagggctggctgggtgctgagcagagagcagcaggcggaagaggcagggtcctagtga 

ggaggggagctggaatgacctgggtgggctctttgtctacacaggcatcgtgtggagccctgaaacccagaacactgcat 

tcccatagtgctcctggggacatgagcgcccagctgaaacaccccacacaggaagagatcctccgaggattctgtcaggc 

aaagggttgggcccctgcaaggtggctcacgcctgtaatcccagcactttgggaggatgaagtgggaggatcgcttgagg 

ccaggtggtcgaggctgcagtgagccgtgatcgtgccactgcactccagcctaggcaacagagcaagaccctgtctcaaa 

aaaaggagcgggagtgggggcagggttcgtcctaggctattgtaagaccaggggatactctgggatgggagtctctggtt 

gtgagttcctgctgggccacaccctatcctggtccccagtgactaatcggccctgcctattgggtagaagtcagggccag 

gggtcagggtgaagctgatgactagaggtcaatatgaagccaagagttaggggtcagtccatgggcagggttagggatca 

ggctggggccagagtttggcatgtgcttgtagccaggaaatggagttcctttctgtgtaaatcaaactgtcccactcaat 

tactctcaaacccataggcaccaccagtctgaaattagtaggcccctgaagcggggctcagcctcccctcattttcagct 

gtggactctgcccagacctagctttcgggagctgggcttgtcccgtgttctgtggtcagcactctccttaagtgtccctg 

ggtgtgtgtgctcgcttggcagtcagccacctcttggcttacctccacctctcggggaatccatgggatcctccttcttt 

aacacaattttttttttttttttttttggagacagggtctcactctgtcacccaggctggagtgcagtggtgtgatcatg 

acttactgcagcctcaaccttccaggctcaagcaattctcccacctcagcgtccaaagtggctgggactctaggcatgca 

ctaccacgctgggctaatatttgtatttttcgtagagatggggttttgctgttgcccaggctggtctcgaactcctgagc 

tcaagcaatccacccactttgacctcccaaagtgctagaattacaggcatgagccaccacacccagcccttaacacaaat 

tgatttagttcctacctccttccttaaagaactcatgatgactagaatgcgattctcagtaagaccaaaatatctaacaa 

gaaaattcagaatctgcatcatgaagggaaagaacactgctcaaacctgcccaactccctgcttttatcaataccttgca 

tatctggacttgttaacataagccctctgtgtcacccattctacaggaaagggcactgaggcacagagatgtggcatgat 

gcactcacgctaggtgccataataggctcttgccatccctgaaaccccagcataggggcactcagcatatcgcacaatga 

gagacggtagataactggacaagcttccgatagctgggaagagtataggcccatggtttggatgtggaatccagtgtaga 

gcgaggactcagtctttttttttttttttcttgagatggggtctcactcttctcgcccaggctggagtgcaatggcgcga 

tcttggctcactgcaacctctgcttcccaggttcaaacgattctcctgcctcagcctcctgagtagctggaattacaggc 

gtctgccactacgcttggctaatttttgtatttttagtagagacaaggtttcaccatgttggccaggctggtctcgaatt 

cctgacctcaggtgatccacctgcctcagcctctgaaagtgctgggattacaggagtgagccacggcgcccggcccagag 

tctttttttttctggtggttctaggttgtttctcctccaggtacaatagcctcggtccagaaaatccattcat teat tea 
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acaaagtggttccccagtgcttagcacagagcctggcatgaagaatctcgataagtgtttgctgaagaatcagaacaaca 

tacaaataaggctgccagctcctccctcgagaccttgatccagtagtatgctggttaatggtttccaatcagctctggaa 

gaaggataaaatcctgatttatggtgtttgccaatttctgtcttgtaaatactcccagccatgactgatttgaagctacc 

aatgtgatgtcattgaatgcagaattgggaagagatagaaagaatcagctcgcatgagctggtgggacccagctcagcat 

cccactccatcatcctgagctgggctctgggctcctgcccctggctcctaagaaccctcccctccccctctctgtgcttt 

cccttttctcctcaacagccagctgctatatgcatgcccacaaccagcacccatccttcttctgattctgattcctgagc 

atccaactccaatggtacagaggacagcaagagtgtctggaaatctccatggtgctggccatgcaggagaatcctggaca 

tacaagagtgtgtgggagtccaggtgtctgtccacaatggggctggtttgtgtgaactgtgtgagcaagggtgtgtgcat 

gtgtctatgggcatgccagagccagctctgtgtctgcctgtgttctatagcaaaactttcagattgggccgctggtgaca 

gagaaggaatgcattggccctgcgtagcctcctcctttgtttagaggagcagccaggcccaccaatgagaacatagtgtc 

agtggtctgcagactggctgagagacactgttgagcaacacttctcaaacttcaatgtgcacctgaatcacctgggaatc 

ttgttaaaataggccagacgcggtggcttacgcctgtaatcccagcactttgggaggctgagacgggcggatcacctgag 

gccaggagttcgagaccagcctggccaatatggtgaaaccccgactctactaaaaatacaaaaattagccaggcattttt 

aggtggcattttaggcaaattaggtggcgcatgcctgtggccccagctacttgggaggctgaggcaggataatcacttga 

acccaggagatggagattgcagtgagctgagaactgagatcacgccactgcactccagcctgggggacagagtgagactc 

tgcctcaaaaataaataaataaataaataaataaataaataaataaataaatatgaaatacatgttctgattcagcaggt 

ctgggctggggcctgaggttctgcatttctaacaagcttccaggtgataccaatgctgctgatcctcagcccacactttg 

agtaacaaagctgttgatggtttgaaatatcctgcctatagcctggccccagaacttcatagacaggatcgagcccctcc 

catcttagaaaataaatattcttcctcacctcccttcagctttagcgccctccaactactgccctcacatgtattcctga 

tttcagccaagcttttcagaaattctagctcaatatgattgcgtcactacactccagcctgggcaacagagtgaaactcc 

atcttaagaaaagaagttctagctcatgtcacctcttgctcactcctcacctaagcttacctcccttgggccactaaaac 

agtgtttccagtctcatctggggaccttttacagggatctagctttacccatctcttggtacaatcttcatgagaccatg 

aggcatctcagtctcctgggccggtatatattcctctgttcagcctttaaatgtaggttctcctcgtgacagagtgacac 

cctatctcaaaacaaacaaacaacgaaggccaggcacggtggctcatgcctgtaatcccagcactttgagaggccgaggt 

gggcggatcacctgaggtcaggagttcgagaacagcctggccaacatggcgaaaccccgtctctactaaaaatacaaaaa 

ttagctgggcatggaggcactcacctgtaatcccagctactgaggaggctgaggtaggagaatggcttgaacctgggagg 

cagaggttgcagtgagccaagatcgtgccactgcattccagcctgggcaacagagtgagaccctgtctcagaaaaaagaa 

aagacaaattgagtcatgcccccaccccaccctgccacctcaaacctttcaagatttgctgttgcccttcatataaaatc 

aaatctgaaatagggtgcacaggccctgcctgcagggtaagcccagcccagccctctgtccagacttaccatgtcctctt 

cccaccccgcattatccaactttgaccttcctagcactcagcaacgcttgtcatttcaaaatcacttgcctgattatctc 

ctccatcagaccgttggtttctagggcagggacagtgtctctctgggtcactatctcccagcacctggcacagagcaggc 

accagaaaacatttagaaggatcctgattccaagaaagtaaagccattttttgacacaagctgggaactgtgggacttta 

gatgagatttaaggaattactgttaatttcattagacatgagaatggtgttgcgatgacattgtttaaaacgtctttatc 

tgttggagagatgtattaaagagttcacgggtgaaacaacatgatggctgggatttgcttcaaaggactccagccaaaag 

taaacaaataaaataaataagcattcagatgagggaacaaagggacaaataaatcccactctgcttgtctgttcctgttg 

acagctacagggcctgcaaggatgttggctgtcccggagatgggcctgcaggggctgtacatcggtaagaacccccacca 

ttctgccctgacccatcacccactcacccgagcaggcactgagacccacctacaggttgtgcaatgtacctctcccacag 

agcttcccaacagctcttctgtgagtgagtgtgtgtgtgtgtgtgcatgtgtgcagggcaggtctgtgtttactggaccc 

ccctgtgtgtgcagaaagggtctgtgtgcaccggtctgtgtatgtgagtatgtgtgcacacaggactggtctattcacag 

gaccctcatgtgtatatgagtgtgggctgagtgcgtgtgtttgtgtgtgccagaagggtctgtgcacacaggagtgtatg 

catgtgcgtgtgttttgttcaggctctgtccacaggaccctgtgtgagtgtgtcagtgagtgtgtgtctctgcgggcagg 

tctctgtgcacgggatctgtgtacgggagtatagtctgcaggtgggcatgcatacagagagttctgcgtgtgcatggagg 

gtctgtgtgcacaggagtgtttgaatgtccatgtgttttgttcaggccatgtgcacaggaccccgtgtgagtgcatcagt 

gagtgtgtgtctctgcgggcaggtctctgtgcacaggatctgtgtacgggagtatagtctgcaggtgggcacgcatgcag 

agagttctgcgtgtgcatgtgcgtctggccatagcctggtctgggagcaggcttgggttctccgcaggggtgggaggccg 

gaggcaagagggacactcctcaggaagtccattcacaaagacctggtgggctgggcccttccttcctgcctcccttccca 

agctcgagccacagcaacttggcctctcgggctgcagctggtggcagggctggagcctgccctggctgcctgaggagagg 

gagcccttgaaatgtggccactttttccctgcccctccagcacctcagactctggccctgtccaggcgcttcccagagtc 

cccctcatgcctggcccttcgcctgtgttcccaggttacccccaaggccagggagagtggagacctgcctggcagcacgt 

gacaggactgtggctgagggagcacagggcctgcctggctcagcctaaggcgagccagacctgcctgatttgtccatgga 

gcttcctaccagatctcactctccctccctagccagcctccctggcctgtccatggat'atcaaggctgccccagtactcc 

tcacatgccccttagcaatgcccagtgtggtggtggcagctaccagccaggcaggccctcGtgccccataagccgcctga 

tcctcggggcgaggcctggggctgtcctccagggctactctactccctcccaccaaaggtccagctcagtcccaggcgtg 
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caagagggtcttgatggggcgtcctgccctgaaatgctggctcagggtgtgtgtatgtgtgtgtgtggggggtgggaatc 

agacttcctaatgttgtggacacccccagttcaggactgggctatctctatgggtgtgggaggaggtgctccccctcctc 

gatcctcctcaagcagccagctgggcctgtctgtggcccttgtgggcatgggagctgcagagttccctcaagtcctgatt 

agaaccacgtggcccaactttagggtggaggggagcaaccccacctcctagccagttagctctgtagctccaccttttga 

aggaggagggcttctctctcctctactccctcccaggagctcacctgccctatctcccctgcccttccatctctggactg 

agtctccctgccacacaatgcttttctgtctctctcccctcccacactgtgtgtctgaggggccctagtggggagacagt 

gagctcctggggaaagcatgggtcatctgctagggaaacgttgcctctgacagcagaccctcagctcctgcccctgcgtc 

atgtgtgtctttcctacaccccgcagtcctcctccccagcagagagccctttgccttagcccagcaaggactgatggaga 

acctttgggtgcctggcttggggccccgctgccatacctaccctgcttgtgccaggatgaactgccgtctcttctctgca 

ggctccagcccggagcggtccccagtgcctagcccacccggctccccgaggacccaggaaagctgcggcattgcccccct 

cacaccctcgcagtctccagtaagcccagagcagggaccaggtggtgggtgacctggctggtgtggacagggtcgtgcgt 

ggcaaagtcatgacagggcctcagctaggaaggaggaggggatgggggtagcaccatgcctcttgtccccgtacactcca 

gtcatgtgccccccagaaggattggccttgggtgaccctggactataaagggtaacaattctactgagaaggcgagcccc 

atgtaactaactcccaacccccccacacaggaggcacccacggccattgaagctggcctggctgaggcaagtgttcccct 

gagcatgtggcttttctaggtccccagagcactgggcctcttgtggaccacagcccaccttaatccaagtcatatctgaa 

ggagaggagatgtgttgccaaaaaaaaaaaaaaatagtctggaaaccattagcttgggctgggcatggtggctcatacct 

gtaatcccagcactttgggaggccgaggtgggcagattgctttgagctcaggagttcaagaacagcctgagcaacacagc 

gaaaccccatctctacaaaaaatacaacaattagctgggcgtggtggtgcacacctgtaatcccagctactcgggaggct 

gagggtggagaattgcttgaacctgggaagcggaggttgcagttagccaagatcgcaccactgcatttcagcctaggtga 

cagagtgagaccctgtctcaaataaaaaggaaactactagcttgaagtatccaggacaccacctccccacccccaagctc 

cctgggtcaccccacctggatgaccttgccctggtgatccagcttagggacccccttattcttaggggagtccagccctg 

gcataggagaaggcacacacttctctggggacatgcacgtgccctctcctctctgccacagaatcagacagtctggtttg 

tgtcactatggccacaaagagcaagaggataaaatatatacactggtccatgtgctgtaaaactgcctggaggagcaaaa 

ttgctcccagctgggcccagactaatggctgcctatgaagatggtgtgaggtgggggtggggctaaggcagtccagaggg 

agggtgggcagaggctggaagcaggaaagcaggagcttgagccaaaccctgcctggggctacctgagagacacgtccaag 

gctcagcctggagcctgggagggcaagggagcccgagcaggtgggcaggtggggtgggccaggtccagggctggtctgga 

ccacagtcagtggaggggagtgtcttccccatgcagaggaagcattggggctgtgggggatgggggtgtccctgctggct 

gcttgctaattctaagtcttgcacttgcagaaacccgaggtccgagccccccagcaggcctccttctctgtggtggtggc 

cattgacttcggcaccacgtctagtggctatgctttcagctttgccagtgaccctgaggccatccacatgatgaggtgag 

gtcggctgggctgagagagtgaggtggggagggtggggagttcctcataccttggtcccaaaagtactgtcaccgagaca 

tggggtctccttggaggccctgccacccccagtctggggctccccactggggtaaaagttggaggatgggccccaggcct 

gggtccttggcctcaactggagcagctcccaaccctctgtaggaaaagtcagactttgtttttaattttgaatttttttg 

agatataacaaacataacaataaaatatgcaaatgttttttattttcctttttttttttttgaggcggagtctcactctg 

tcgcctaggctggagtgcaatggcacgatcttagctcattgcaacctccacctctcgggttcaagcgattctcctgcctc 

agcctcccaagtagctgggaatacaggtgtgcgccactatgcctggctaacttttgtggggttttttgtttttttttgag 

acggagtctcactctgttgtccaggctggagtgcagtgacgcaatctcagctcactgcaagctctgcctcctgggttcat 

gccattctcctgcctcagcctcttgagtagctgggactacaggcgaccgccaccacacccagctaatttttttttttttt 

tgtatttttagtagagacggggtttcaccgtgttagccaggctggtctcaaattcctgacttcgggtgatccgccagcct 

cggcctcccagagtgctaggattacagatgtgagccaccacacctggcctttttttttttttctttgagacagggtctcg 

ctctgtcacacaagctggagtgcagtggtgtgatcatagctcactgcagcctcaaactcctgggctcaagcaatcctccc 

acttcagcctcctgagtagctaggactacagacagcaccaccacacctggctaatttaaaaaaaaaaaaaattttttttt 

ggagagacagggtctcactgtgttgccagggctggtcttgaattcctggcctcaagtgattatcccactttagcatccca 

aaaatgctggaattgcaggcgtgagccaccatgcccagcccacaaatatttttctaagcttgtagacaaacacacacata 

tacaaacatatatgtgtttttttaagtaaaagtgagatatacaacttgctttgttttaaagcttcaacactattttgtag 

atatttcatgtcagcacatacgaagctaccactttctttgaacggcctagtattccatagtacagatgtgtcataattta 

tccattcccctattgacagttttaggggggttttttcctcctctcatttttcactattacaaaaagtgcagcagtaatat 

ccttctttgaaaacttgtgccgcaatctccatggggtagattcctagaggtgattcctaggtccaacatctcaaatcttt 

tttcttttccttttttagtaaaaaggaaaagaaaaaataataaaacaagaaataaaaataaaaacaagaaaatgaaggtt 

ctaagggctgagttgacaagtcacacagttgttttgaaagacatgtttcaaaaatccattcaaatgttgggttttatgtg 

ggacctcaaggaccttgggacctcaagaccccagtcctctgaatatgtcctagagggtcggcaggcagtaggtgaagcta 

caggagacctcaaagctctcaacccagactgggatgggaggtaagaggggcttcccaaggcctggagactggatggaagg 

gtgaatggagacatagacagacaggtaactgctcccaaaatacatgggacttgagcttcatgagactggactgtccttcc 

tfcccacatcacaggaaaccacaaaggctctgagcacttacagccctctaggagctgcagggcatcccagatgtcttgctg 
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atcaacacacatggctgaggccatgctggggctaaggacacatccaggaacactgtctctgtcctgtggaggtgacagtc 
acaggaaataactgggggacattccagcctcctcccaagctctctcttcccaccagacaccaagcccctaacatctctag 
aatacccatctacttctctccatggccactgatcgtgcccaagttcagccccgaccttctctcacctggatggctgcaat 
atcctcctccttggtttcctttctccatgctcccttccctgcattctgctgctccatgggagccagtgacctttctttct 
aaaggcaaatgtggtcatacctcttggctactttaaatccacgaagacagaattctgaatggccttctgtgctcttcatg 
atctcaccttagcattcttgtgttttctcctgctgcctatggagccgacccatagctagttctccctgctctttcacatc 
tgtaagtttttgcacatgctgttccctctgcctggattcccacccaagaagggagggttgcactgactgccctgtgcctc 
cgcaggaaatgggagggcggagacccgggcgtggcccaccagaagaccccgacctgcctgctgctgactccggagggcgc 
cttccacagctttggctacaccgcccgcgattactaccatgacctggaccccgaagaggcgcgggactggctctacttcg 
agaagttcaagatgaagatccacagcgccacggtgagtcacagggct ccagacagggaggcggggccagcatggaaaagg 
gcagggctaatgggggtgggtgggacaaaaccaaaacgtgtgaggaccggcccgatggagtcgtggctgagagggggcgg 
ggctaaagggagacgtcggactccggtgtgggcggagctcagaaatgaggtggaggcggggctaatgtgggtggggctaa 
tagtgaagctggggttgcaggaggggtggggctaaggagaggggtcggggcagagctaatgtcacatggggcaagagtgg 
gacggtggtaaagaggaggggaagctccaggaaacggggtgattttaagagcgaggtcgtcaggaaatgagtgccaagct 
gaggcctcctgcagagccgccctgtgtccctgccaggatctcaccttgaagacccagctagaggcagtaaatggaaagac 
gatgcccgccctggaggtgttcgcccatgccctgcgcttcttcagggagcacgcccttcaggtgcgctgcggccccacct 
ctgccgactgtggcagggaccccctattttcccctcatccgaaaccgctcccccatcccgtccccgacattggatgggta 
gccaccgccggagctcagaggtcatcttctccagtaccctcctccctttttgtctggtagagcctgcaccaagccatact 
gatgggaggggggccgattcttccagctctgctgggaagtccttcctgtgatttgattagtacctccagttccgcagagg 
gctgaagaccaccctccctccaagccagctttcctctcactgccccctcctgtaccaggagctgagggagcagagcccat 
cgctgccagagaaggacactgtgcgctgggtgttgacggtgcctgccatctggaaacagccagccaagcagttcatgcgg" 
gaggctgcctacctggtgaggacgtgcaggcgggcccgagaacactgctcaggaagggccaggcctgtccccatgcttgc 
atgcaccccaccacccttgagaccacagagtcattgtggaaagaacttcagcctgctcctgatgggagtttgtagagttg 
ctcccaccagaagagggagtgggcctgcaggaacaggggacagagggacaaaagacacagcccaggccagtgtagacagt 
gccttaagtcaggtgtcctaaaagcagagcccaagatggagagtcttttttttttttttttttttttttgagacagtctt 
gctctgtcgcccaggctggagcgcagtggcgtgatctcggctcactgcaagctctgcctcccaggttcacgccattctcc 
tgcctcagcctcccgagtagctgggactacaggtgcccgccaccacgcccggctaattttttgtatttttagtagagacg 
gggtttcatcgtgttagccaggatggtttcaatctcctgaccttgtgatccgcccgcctcggcctcccaaagtgctggga 
ttacaggcgtgagccaccgcgcccggccaagatggagagtcttatatgtgaggtctattgaagaaggtttctcaggagaa 
aggcgagggagggaggcgggacaggagaaggagctatgaaggatgtggtctttgctggagtctagcctcaggtttagcct 
cagctagatcccatggggagctgcagagggagaactgtagcacagagttggggccggcttcttgcacatccgtatcggtc 
tgtcactggttctggggaaatgggagggaaacctctccaagtgaggccattcccattcagctgagggcagttatccagag 
gaggtagcagctgtgagccaatagcagccaacactcacagtggcaggcagggcacccagaacattcacttcacttggcat 
cagtattttgagggacacccccaccaccgcccacctttgttggttcttgagcaggggaaataggactatgaaataaaggg 
aggcga 1 1 c t agcc aatgcat acagg t c 1 1 cct accac t gagatgc t caaaggt c t agagcaaggagt tggggggggc t c 
ccgtggagacagagagggcatagcctcaagactcattgcttggtcgtaaccctagagccttggatacagaccctaagctg 
tccaaaagggatctcaagcctgcctgtggacaccctggagggacccccaatcctcccatacctatgggccaggcatgacc 
cagggccaggtctgtaaatgtctacacttttcactggaaggagctaaccctgagcacagtggacatccctgtgcaaagat 
caggatggaagcaaggctgtccccagccagaacaaacacctgctcccccagcgtccccttgcctattttcccaccctcct 
tctcacgctgttccctggctgctcaggaaagtctggccctgggtaaaagtctccattcacacccttatccctctccccta 
tctggtcagaatctggggaaccctacaaaaccataatatacttcccacagggatcccctaccctgaaggaaataactcca 
gatctcaagtgtttttctccacccaactggcctgtcttggtgcctggaatttcagcttcctccctgctgtatcagattcc 
ctgggaacaaagttctcctggaagtggggtggcctatggctaccgtctgacaccagcctagtggagaaagagggttcaat 
gagctaaaagggctccccaccacctcctctgagccatcacacacccccacattgtgctagagtctctgccaataccccca 
gccagcgctcagctggccgcttgtacctgtgaaggggaaccttgctgcagccccgctatcttgggaaatttgaaggggca 
gatcccaggggttctaggtctgcattctgtctagagtcttttcctgctgggcaacttcttggcatatttggccagggcca 
ttccctccccagcctggtcacagcgtggctatgaggcagctccaaatttgtgcagcacagaggggcctgagaggcctaac 
atgggtggggtgtgatggaggaggaggtgggagccccacaggccgcggttccccacctcacagtgccatcttaggtgtga 
cagcccacagtgctgcctgaccctgcccaccacccatccccaggctggactagtgtcccgagagaatgcagagcagctac 
tcatcgccctggagcccgaggccgcctcggtatactgccgcaagctgcgcctgcaccagctcctggacctgagtggccgg 
gccccaggtggtgggcgcctgggtgagcgccgctccatcgactccagcttccgtcagggtgagctgcccccggggacacc 
acccacccctggagggtcagagggtcactgaagccagaagctcagccatgtctagtatgaaggggagagggtacccaccc 
tggaggagcccaatttgagcaggcagaaacatggctggtggagcctgtctgaggaagggaggcacagccctgcccaaggg 
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caacctcgtctcagggtgggacatgccctacctagggcagcccaatctcagggtggagatttcaccttgccctaggggag 

caaagtctgaggagggcaggactccgcccagccctgagtctgagggttgggggagacacagactaccccaggaggaaact 

gctggggtactgacaaaggaggaagtcagcccactgcatcagctgtccccttcactcccctcatcttccccgacacacat 

cagccccaagctcctgctgccaggaccaggcacccaggccaaggacagctatggtcatcctcccaaatgccatctcccag 

ggcagggagaagccctaagtcctgagtcccctctgagactccaaagacctacctgcctccgctcctccaaacccctctaa 

ccctgattttgccatgacctgagacctcctgcgttaaaggaaggccctgtgtccataaatatttccccacagctgttgga 

tacagggtgggagtttggggttcaggattgccctctcccagtcaggagcaggttggagtttcaggagcactggctgctcc 

cagtgcccatggaggtcctgggcaggaggatgggagttgaacgccatagctggagcacctecttctaatctcactccctg 

ctgtctcctgacccccagctcgggagcagctgcgaaggtcccgccacagccgcacgttcctggtggagtcaggcgtagga 

gagctgtgggcagagatgcaagcaggtagggggaaagggggacggagtgttatccttggcccctaccgggcaccatatac 

tgatggggggaagggcatgtttgcaaagcccgtctcttcctcctccattcgctgtacccaacctggccgtcccctcacag 

tcacccgcacccccaccccactcacagcggcgcccctaactcccactcctccaggggattctccgcggacgctcgggtgg 

agttgcagagcctctggaaccatttctcccccacaccctgcgcccatatgtggtggtctgaggttcaagcacctgaagcc 

cctcacgtccctcccccgaccctgcagacaggccttgggacccggggcagggctggaggctgggcgaggctggagggggc 

gcagggctgagggtgcgaggccgcccacgagtgtgtgcccgcgctcgccgccgcaggagaccgctacgtggtggccgact 

gcggcggaggcaccgtggacctgacggtgcaccagctggagcagccccatggcaccctcaaggagctctacaaggcatct 

ggtgagtagccaggcggcgccccggtacccagcgcgacccgggctccggccccgccactgccccctggcggcccggcgag 

cgctgacgccctcttcgccccctgctccaccccagggggcccttatggcgcggtgggcgtggacctggccttcgagcagc 

tgctgtgccgcatcttcggcgaggacttcatcgccaccttcaaaaggcaacggccggcagcctgggtagatctgaccatc 

gccttcgaggctcgcaagcgcactgctggcccacaccgtgcaggggcgctcaacatctcgctgcccttctccttcattga 

cttctaccgcaagcagcggggccacaacgtggagaccgctctgcgcaggagcaggtgggtcctgagcccgcgggctcagg 

cagggtttgccgacccgggaatgaccgtgcactggagggtcccgggccccaaggaacggtgggggtctgcctgattcatc 

ccacatatacactaagccagcagggcgtcggggtggggcggcggggagcggcgagtgagtgccccagcccagcaggctcc 

acccacggaatccgcagcccgaactggggcaagacagagaatcatagcggggaggcggcaatgcctatctcctcccagcc 

ttctctacacccccaccccgggccctgcgggcccatgctcctcggtttccctgcaccaaagcaaggggaggcccctccca 

ggacctcgtacctggaacctggagcaggctggcaactaaatcctctgagtgagtagggtggagataagggactaacatcc 

cgcaggtccagtcctccagacaccacgtgcagtcggtgcccaggcacttctigcctggaggcagaggtagagaataaggac 

cacggaccccaaactggggcaagcagctgggccctgaccgatggatatttgcccctttcaccaccaacagcgtgaacttc 

gtgaagtggtcctcacaggggatgctccgaatgtcttgtgaagccatgaacgagctctttcagcccaccgtcagcgggat 

catccagcacataggtgagcacctgagcttggtcccccacccgcccctacatgaacaaacagatgcagaataattccccc 

ctatcagtgcctagatacctccacacatccatacactgtgatgagacctagaatcatctagaacacctgcgggatgaagt 

gcagtggtgattaagagctagagggttgatatgtagtcttgccaaggcaaaaaacttctggtgcctcagtttccccattt 

ataaaatggggtgatagtattgggttctcaaaacgttatcacagggataaaatgagctgaagtacctagagtgagcacaa 

tgtcttgcacacaatgtctaggtgtttaatacgtgtaaaatgcatatccttatctGtGgtcctccacgtcgtggtgggag 

agaagtggggagcgtgagtgttggggaggcgaagccctcgaggactcccgtgagctctcaaagaaagtgctcaaatggct 

actttctagtcgccaggtaggtacaggctagggaggggaggcgccggtggccgcctagtggtggcctcagtggctctctc 

tcccccgccccttctcctctgcccccttcacccgcgtccccccgtcctgtcccgcagaggccctgctggcacggccggag 

gtgcagggtgtgaagctgctgttcctagtgggcggcttcgccgagtcagcggtgctgcagcacgcggtgcaggcggcgct 

gggcgcccgcggtctgcgtgtcgtggtcccgcacgacgtgggcctcaccatcctcaaaggcgcggtgctgttcggccagg 

cgccgggcgtggtgcgggtccgccgctcgccgctcacctatggcgtgggcgtgctcaaccgctttgtgcctgggcgccac 

ccgcccgaaaagctgctggttcgcgacggccgccgctggtgcaccgacgtcttcgagcgcttcgtggccgccgagcagtc 

ggtggccctgggcgaggaggtgcggcgcagctactgcccggcgcgtcccggccagcggcgcgtactcatcaacctgtact 

gctgcgcggcagaggatgcgcgcttcatcaccgaccccggcgtgcgcaaatgtggcgcgctcagcctcgagcttgagccc 

gccgactgcggccaggacaccgccggcgcgcctcccggccgccgcgagatccgcgccgccatgcagtttggcgacaccga 

aattaaggtcaccgccgtcgacgtcagcaccaatcgctccgtgcgcgcgtccatcgactttctttccaactgagggcgcg 

ccggcgcggtgccagcgccgtctgcccggccccgccctctttcggttcaggggcctgcggagcgggttggggcgggggaa 

acgatagttctgcagtctgcgcctttccacgccctccagccccgggggagataaggtcatgggagagtgggtggggacac 

acccagagactggctttgggattgggcactggtccgctgactgccaggctgaagggacccgccaaggactgaacgggtaa 

gagaagaggtttgcaagacagagcgcgcagcccggcaaggggcatgtgaccccgaaggaagaacgcaacagaagagtcct 

ggtctgaacttggccgagtaggggtgggggtgggatggcaggaggagccgcaggaggaaggaggttgtgcagggtctgga 

cctgcagggctgaagttcactcatcgaccgactcagccccaaccgggagccaggcagaaaaaccctgtgccgtaggaaag 

tgactggaagtggactccagagggacaggtgtggtggcacagtcctggtgtggtgctgaccacccaaatatgactgtgaa 

ttgtggaaagggcagtagatctctaatgtggaggtgggaacattattgtggtggaggcaattatgagggtagcatttctt 
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tcgagacaaaacacccgtctgggaaggccccaaggtcagcttatgaaggaccccacttgcaccccaccccagccatggaa 
gagcagctggagggtggatggggaggccagagggagcaatgaggggtggtcccagctctgctattgactcggtatgcctt 
taggacattctcttaccgctcatgggcctcagtttcctaaagtgtgaaatgtcaggcacttccctctaactggcatgcaa 
cagccccacctgcctgagagccctgaggtgacaataaaacatttatgctcaaggggaagccacagcctgctgatatggcg 
tggagaccctaatagtgggaggaatgcaagggttcccggtgctagagagagaagggagaaagctttcagctgtgcatagg 
gaactgaccagaagggggtgctgctgtctcccatcaagcatcccaaacaactccactgcttaagacctctctggcctaca 
catgaggtccctctcbcctcattcaaattaattgtcttggaagccagcttctggcctaaaatgccaccacctgtgcatac 
ctcttgtggggctaggtgctataataccacgcggtgcccctgcctcctgagtgagtctacccaagtctttccctggccca 
tctgcaaaggagtaggcattaccccaaccccagagaacaaaaatccacctggcctccggtatccactggaagtttatttc 
tttagggttctatcccaaccagtcgcttaaaaaccaagtaacacagacctgaggggtgggggctggggactgcacctccc 
tcctactcatggtggacagcagtggggactagggaggggcaggagaggtggctgaagcaaggcagcagtaatggggccac 
gacgccacagagccagctccgtcctctcccagaccctggtgggagtccctgtggcttggggtggggagtgggggacccac 
cccaggccctccctctcccttcctcagacagcctcctttcgggctcaacccatttcttccggcaggagactgaggcacac 
agagaggaggaagtgggagaggaggacgagggaggggcagggtggcagcacaaatgaaggcagaggtgagaggcgtgggc 
aaggccactccacccccacacccaccccagagaggggcgaggaagccacaccatcacgcagcatgtcggggggacaaggc 
ggggtttaaggctgaggggcccggggcaggcggggcctcgggcctcagtcaaagccgtgccagtcgctg'tgctctgagtc 
gtattccagctcggcgcccacacacttgacaccatccagcagcatgggcgtgccgtggtgccggtctgcaaggcagggtg 
caagtcagtgccatgctggcccccggccccacccatgcggcccactaaggggacccctccccttccctcagggatcagct 
ggaggtagggacctgccaaggaggttgagaacccctgagccgggcaaggatcccttgttcagccttggttccctgaggag 
gacagaaaccctcgcagtcgagcttgtgcatccctcctccaaccaggagcctcacgctctcacccatgacgcgggcctgc 
accgtcacgcgcacacaggtgccagtgccaccttcgcaggctagcagtatctcctctttgaggacgccctctttgtgcgc 
cgagtactcacacttgacactataacctgcccggggacaggggcaattggtcagcacctcccccagcctccccgatcctg 
cccctggcactcacgggcccctacccaccaccatggggggactgcatgctaagcccccccaagaaaggggcagggatggc 
ccttctggagcccagagacccattccccctagcaggggtgcacaggtctcaaaaacccattcttcagtgagctggacatg 
cctccagcctgtgggccaccccaatgtggctccaagagtgaatgaagtagggCccaattcaggcttccaaaagaaggtct 
ggctgttttctcccaaaaggaaggcagggagaggcggtgatgaggagtgaggggggcagggcagggtaggctttgagcag 
atccgatggcaagaggtaaaggcctgaggggtgtcaatccattgaagggcaaggtcatgaagaggctggtgacggggaca 
tgaaactgaagctggagctccacgaaactgacaccctagcctctgccaggctgggagtggggcatggggcagggcctgaa 
gtgagcctgataggaacagcagcctgaaacccaaggtctgggactggtgggtgctaggagttatccacacggtgtgtgta 
gctccagcaggtttttctagagggttagggaggcaggagggaaggctggaggcttcaaaccagttcctcagcagctccca 
tcttggttactgccccacggaggtaaccatcacaccatgggcaggtacagggaagtatgagctcatggacttctgttcag 
gacagggagcaaggcctggagtgtgggacctgccatgctgccacagtgcaagctcacaaagaggtgccacatccccgact 
tgctgagctgcccatccaccctagttggaagtaagggagcaccccatgttctcactcccacacccatccaggccctgctg 
gaggaggggacgcaccttcagggacgggcaccacgctgaggagcttgaggtgcaggctggggacaggtgcctcgcggaca 
tccttgctcagcctgtgcactgggggcagagtgaaggtaatctcatacctgtgcaggatcttcaggaagccaacctacag 
caggagaggtgagggagggaggcaccttcacttcctgcttcccacaggcccacccagttgtgccacctatgatcccatgc 
gccccagcacctgcttccccagagagcccagaaagcccaactccctccatcactccagctgatcgacccccagccctttc 
acagaccccttgagggtcccagccctacagcttggccagcaagatcctcagcatctgtctcacgggccccaaagtcctca 
gtggctcatttcatagacaggaaaatggaaaccacaaaatgactggccaaaggctatgtagtgtggccagggtcaagtcc 
tccctagacttcttggccattcctgctatgcatggcacccagcaaggcaccacccactgcctacaagggaaagttcaaag 
tcccatgggtcattcacctgtccccatcactgccagtatcccagggtcagggatgctggtcctccattcgctcataggat 
gacaccaggccacagatagcgtgggatgagggatgggatcaaattagagatatgacagtagaccagctgtgtcagcccag 
agctggcccacagacctggcaccagccactgaactggggctcctggctgccctgctggcactggactggggatgcctgcc 
tgcccactcaaaggctgtgtccaatggcccat act cacgacccaagcaatttctgttgt teat tcacagtgcttggtgac 
aaccaccctctgggacaaatgccattcttggaaactccagtagtatgaagggtcacaagccagggtggttgctgagcagg 
ggggctggtgggggtgccacgccagcaggcaaagggtgcctgctgtactcccagcttgcatgggcacacagagtcctgct 
cacagttactggggctgggcagccccatccctgggggccaactgggactggctgcagagagttttagccattcaatggga 
ccaggttgatattgctacatgacaaacttcaatccatgtgcccccctcatgctctgctggtagggggcacatccctcaag 
aaggctttctggctgtacctgctaactgtgaccctgtgacccaggggtactacttatagaattttccaggggaaatagag 
atgtgcagaaaaatactcttcgctgccctatttataacaatgaaaacaaaaacagccgtaacacccaggaacagggagcc 
agctaagtcaatgatgacacatatcatggaccagtaggcagttgtaaaatctcgtggaaaattatttactgacacaggta 
ggagacagttcacaacagaagcaagcaaaaaccagtgtggacacggtgaggccagcttttgtttttggaaacaaagcttt 
ttaaaataagctggatccatcaccatagtgttttcagaaaaacaaataaataaataaataacattttaaaaagctggaat 
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aagccaggtgtggtggctcacgcctgtaatcccagcactttgggaggccgacacggacggatcacgaggtcaggagatcg 

agaccatcctggccaacagagtgaaaccccgtctctactaaaaatataaaaaattagctgagagtggtggcacgcgcctg 

taatcccagctactctggaggctgaggcaggagaattgcttgaacctgggagctggaggttgcagtgagctgagatggcg 

ccactqtactccagcctggtgacagcaagactccgtctcaaaaagaaaaaaaaaagctgaaataacacacatcagaatat 

taaccatggttattttgggatagtatgactgaccgtacgctgaagaggatgcgtattgactgtgctttccttctttgctg 

gtggatatgaaaactggaatggacattgctatgacgggatggagctttcttacaaagctgagcatgagtcttaccataca 

atctagcaatttcactcctagatgactacccaagtgaagtgaaaacttatatccagggccgagcacagtggctcacgcct 

qtaatcccagcactttgggaggccgacacggacggatcacgaggtcaggagatcgagaccatcctggccaacagagtgaa 

accccgtctctactaaaaatataaaaaattagctgagagtggtggcacgcgcctgtaatcccagctactctggaggctga 

Qgcaggagaattgcttgaacctgggagctggaggttgcagtgagctgagatggcgccactgtactccagcctggtgacag 

caagactccgtctcaaaaagaaaaaaaaaagctgaaataacacacatcagaatattaaccatggttattttgggatagta 

tqactgtgggtacttttaaatttcttcatattttctctgtcttccaaattttctcatgtctattaagacatgagtggatt 

ttgcaatgaggggagagagctatttttaagtgttggtttgttccatttactctcttggggaagctgtcagctgtaggaca 

aagagtgggaacttcagagttggacagaaatggatacaaacctgggccccccaggttcttttttgtttgtttgtttgaga 

cagagtctttctgttgcccaggctggagtgcagtggcacaatctcagctcactgcaacctcagcctcctgggttcaagca 

attcttctgcctcagcctcccaagtagctgggactacaggcgtgcgccaccacgctcagctaacttttgtatttttagta 

gagacgggtttcgacccattggccaggctaaacctgggccccttttgagcagcgcagcagaccccccactcagggccatc 

tcaatgggccagacctcccccgacccaagggcactcctgttcaactctggacccctgatttattgaccaacctgaaggcc 

tcagtctccattttccccaatccagctccaccaagcagaaaacagggggtgataatactacttcaaagagcctaactgag 

ccagagatttggcaaagaagggttaaaaaaaaagttgcaccttgttattgccatagttagcttcacacctgtatcacata 

catacattcttccattctcacaccaactctaggggatgtgattgtctccacttgagagaaaagaaactcaggtgaccttc 

ccaaggtcacagagccagaatggctggcctagacctgaactcaggccactggcaccacagccacgacactgccttccatt 

gctattgtctgggtgaagcagatgacagcagtggctctgctcatcttgatctggatgacattctaaatgctctcatctca 

ttgaaatctcactgcctcccatctgacagatggggcaaccaaggcacagggaaaggcacagacatgcctataagcccggc 

tgagaggaacggggataggcacccagaagccaggcagagccggggaggggaagaagctgctgatggggtaggtgtgcatg 

taccttgaccagaaagctgctgtcactctcctgggtgaccatgaccaccgagtcatgcagcttctcatcaaagtggacgt 

ggccgtgggatccttctgcatcgtggcctgccgcaaagcggatactccggactctgggcttgttgcctaggaagagtggg 

agagggctctgagggctttccctgtccccaggaaactcctcccccttgtcccctcgtcacccccaagactgcccttgaca 

tcatccagctcccactggctgggctcttttcagggctagatggacactaggatcatgaggcttgaggcctccccgccggc 

cccgacctgcccctcccacaaaaatccatcccctgagagtgagcgtggggggactcagggactggctaatatgacccttg 

cttggagggggtggggctggtgccagagccaggaagggacagtcttccagactcccaccaagccagggcagcgtgggact 

cagcccagtcctctggataccctgcccagtgctctccttcacagtgcaaagctccccatccctggggcctacccctaagc 

tgtgtcagtgcatgaggcccctgcctgcccatcctcatgacccaaacctgaaagggcagggacaggaacaggctctgggg 

gatctgcctggggtggcagaaacaggggggatcaaaaaacacactcaggccacctgggcgaggctgcagctgccactaaa 

-ctcactggcccctggcagcaagaagagagaatccaaggaagcttccagacccacctcaagacccccttccttcctctgt 

ctagcccaccatcaaggccctcagctgtctctgagctgggtcgggttctccaaggccagaaagggcagacggatgagaca 

ggcagggtaggcgggagccgagctgaaggcctggactggatgaatgctcaggctgtagaggccgagagcacggccctagt 

cctctctgacccctggcccctaggccctcccaccaagagccctgcccagggtcctcttggcgggaggacctgatccagct 

ttgagccggcctcttggatactcggaggcccaggggaccagcctggcacatccacaatccctggtgagccccacagaacc 

ctcaaatctcaccagatcaccctggcctaaaaccctcccttatcccttctggactctgaaagaaacccaaactcccttcc 

acagcctgctgtggccctcccacatcaccagtctcagtgctcaccataagcccctcggctcccccaacccccacagtgga 

acggcagggacacaagagcacacacctccacagaccagcctaaggagtcagcagggctctgcaggggtcctgctagggct 

tgcgtgagccccagaacctggcacaaacactgacatagtcccaatcacattcgcacaccagcacacacgcactcacacat 

gcacacacacacacaggtggtgcgggccctacctgcagcaatggccctcctaccagggcccaccggaagagcttgtgcaa 

gtcctaccacgccaggaaggcacttacccttgttggctgcagccatggacgccctccctgccacgcagctcctgccagac 

accgccactcacgctcagcagcctcccatgctccagggacaccagcgggagcctgaaggatagggagtggggagggcagg 

ggtcagggccacccatggacccatggctcagggacaccagaggagctccctttaggaaggactcaaacccctaccgctct 

cagcagcccaagtggtgcctgtactctctagcacgggggcttctgcccaccctctaccgactgcccacattcaccaagag 

ccctcaccccttcctgataccccagtgactcagggtcctgccagcatgtgctgagctgcagcttctaggtgccaaagcaa 

cagcataggactcctatccccagcacccccagagactggcaggaggggcagcctggaaaggaggactttattggtatttt 

ccagggaccagttctggtgctgctgaccccaaaggctgggcctggagctaccttattcaggtcacacaagcaatgcagct 

gggtgcggtcagaggcaacagggtgctcgatggtgggcaagaacaggggaccacataaagtaagcgtgcccttagagctt 

tccctcctggtgatcggtcagggccatatgcaaaacagatgcctgtggagggggtgtgcccccacctgaaccccattcag 
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accctgccctgagtctcaggcctgcctccgcctcagctccccaacggcagcctcagcacaggggcagtgagaggcacccc 

aggaagcctccaatgggccgagctgggaccatctgagcatcaaaaagaataatgagtgcaactgatggaaacagatggaa 

cacataaaagcagtgcattcacagagattattagaaaagaagagagaaagcaaaacaaacaaacccaagccctcaaaagc 

cctcatttgtcaccactgacggtagcagtgccctctttactctgaaagctgaggattaaagaggaagaattcagcctgtc 

tcttggcctttcttgtaaccaaattgccctggtggttgatggaaagctcttctttcgggaagaattcttgctaataaata 

acaaagaaatgaccaaatgctaagtcattctgcaacccctaaaggaacaaatgctggaggcaacaagaaccagtggatgc 

taaaccagaggggaaggttgacaggaagcagatactcacaggcgcccaagcgtcactgacagaggacctggtatggtttt 

tgttttttttttttttctgagacggagtttcactcttgtcacccaggctgggatgcaatggcacgatctcggctaactgc 

aacatctgcctcctgggttcgagcgactctcccgcctcagcatcccgagtagctgggactacaggcgcccgccaccacgc 

ctagctaatttttgtatttttagtagagacagggtttcaccatgttggccaggctggtctcgaactcctgacctcaagca 

atctgctcacttcggcctcccaaagtgttgggattacaggtgtgagccaccatgcctggccgaggacctggtgtttaaga 

ggaatggcacaagtttgcaatggaggtatgtgacttctcccagtcacacactggtctatctctctactctctgcaggaca 

gcctgtcacactgatgcctcctgaagtaaggcagcccaaagtcacagcatcactgacaagggattcctgccaaaaaggtt 

taacctggaatctaaaccagcctctaacttcaacttctcatttgctaaaaacacagaggagaggggaacaaatttaatga 

ctccatgagaaagcaatgagacgagcccagaaggcaggatattctacaggatgactgacctgttttttgtttttgttttt 

gttttcaaatgtcaagaagaaaaagaaagcaggctgggaacggtggctcacgcctgtaatcccagcactttcacggatca 

caaggtaaggagttcaagaccagcctggccaacatagtgaaacccagtctctaccaaaagtacaaagattagccgggtgt 

ggtggcgggcagctgtagtcccagctagttgggaggttgaggcaggagaatcacttgaatccagaaggcggaggttgcag 

tgagccaagatcgcgccacggcacttcagcctgggtgacacagcgagactctgtctcaagaaaaaaaaaaaagaaagaag 

aaaaataaaacgagggagattattctagataggaaagcagcaaactacagctgattacacagctgatggtaaggttgctg 

cctgttgtgtttttgtttttgtttttgagacagggtctcactctgtccagcccaggctggagtccagctcactgccaccg 

taatctcccaggctcaagcaatcctcccaccccagcctgccaagtagctgagaccacagccacgcaccaccatgcccggc 

taatttttgtagagacagagttgcccaggctggtctcaaactccggagcttaagcgatccacccacctcagcctcccaaa 

gtgctgggattacagacgtgagccgccaggcctggcaaataaatttttgcaaataaagttttactaatagaacactacac 

ttcattcacttacatgttgtctgtttttgcactaaaacaaagttacttccaagtagttatgatagagaccatatggcctg 

caaagtctaaaccatttattatttggcccttcagagcaaaagtttgccaacccttgttctagatcaacacacacttaaca 

gaccaaaaaaggataatcttcaggacaactgacctgtttttttcaccacatcagctgcaagaagaaaaataaaggagagg 

gtgtttcgaactacctatcaaagaagacattttggggggtgccgggcgctgcggctcacacctgtaatcccagcactttg 

ggaagccaaggcgggtagatcacctgaggtcaggagtttgggaccagcctggccaacatggcgaaaccccatctctacta 

aaaatacaaaaatcagctaggcatggtggtgtgcacctgtaatcccagctactcaggaggctgaggcaggagaatcgctt 

gaacctgggaggcggaggttgcagtgagccaagatcccgccactgcactccagcctgggtgacagagcgagactctgtct 

caaaaaaaaaaaaaaaaaaaaggccgggtgaggtggctcacgcctgtaatcacagcactgtgggaggctgaggcgggcgg 

atcacctgaggtcgggagttcaagaccagcctggccaacatggagaaaccctgtctctactaaaaaatacaaaattagcc 

aggcgtggtggcgcatgcctgtaatccagctactcgggaggctgaggcaggagaatcgcttgaacccaggaggcggaggt 

tgcggtgagccgagattgcgccactgcactctagcctgggcaaaaggagcaaaactccgtctcaaaaaaaaaaaaaaaaa 

agacatttgggggatctggaaaatgtaaatggcctggatatgaggtaacactaagaaatgagttgatcatggaattgtat 

tatgtaaagaattatgtccttattttgtcagggaaacatactgaacaacttagagatgagtgacactggctcatcagtcc 

agggacagaagaggaactacagggtggcctcatagtttagggctgggcctgtggagggcatgatctcaaggtccctctgg 

tgccaagctgctctgggagcagatgtggcctcacctcgctccccacttctcaatgtgggcaaagtccacccaggcccaag 

ccctgcctcatgggcagggtgttcacagtttgggtaccaccaggagccccctttggccaatggaccaggccagcaacccc 

ctcttggaggtgagcttccaggccccagcccagggtgcaggaagtggagctacacccattcacctctggccagggccact 

gtggggtcagttgcctcagccctggaaagctcagcttgtccccagggaacttggtcttggagagcagcctgccgtaaccg 

ccccccgactacagctgcccccaaagctgtgaactcagagtatatgacaaatggccaagcacaagagtgtgaggccctcc 

tatcctgcagaggctgctccagccacacctctgcagccaggaggcagtgacagcccagtgtcctcaaggtccccccatct 

cccaggatgcttcctgaaggctcaggttaaactgcaccagtggctggtgagcaaaggctgcaggcccttttcacaaaaca 

ctctgaactggtccatgcagggcaccctccctgtcttcctcccaacctactgcctgaccgtccccccgccctgcctttgc 

ccaggtcatcccttcagccaggaacaatcccgtccctcccagccacaccttcgacagggtctcagccccaacatttcttt 

tttaaggatgcacaccctgcaaatcccaggacaggactacagctctttttggaatcccctttctcatgtctgggatcatt 

tgatatccctcttcctcaccttacaagggtagtgactgtgtttttgtcaaaccctgtgtccccggcactcagcgcagcac 

cagacagggaggagctgtgtttggtttatgtttgttgaatgaatgaccacatcacatttgcttggagggcctggccaagg 

cccgtacttcagccctaaatgatatggtcagggccagtgactgcactgggggctctgaggaagcagaactccccacccca 

ctcttttgcctcacccctgccctggggggcacttccctctccctggcccccaccaagtggcctcccactgggactcctca 

ggccttgctgcagtcagctggttacctgtccatgcctgctttgagagggacatgccccatgcccactgagggtaggaccc 
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acatctcatgcagtgccagtactcagtaaagggcaaacactgagggcctgtaaccctctggatagtgacaacatagaggc 

aggaagcaagggacttcaggaacccaaaggaaactgggaaaaccaaacctcctttctcaatggagaacctgctggtcgct 

gtcctcggggagctagactcttgtgcacacacaattcctcatcagaaacaggcaaagagggcactgggcctcccctgctg 

cagctgtgtccacagagaacagctaggccaccacacaaccccaagcctggccctgcctccagcctcctcctgtgccaacc 

aaggcctcaccatggctcaatccacatggcccccagaggcagcagtcctggttcaaattcaggatctgcccttgctagct 

gcatgggtgagttctttcacttctctgtccctgtttcctaatgtgtaacacaggaataagcagtggcctcttccttccag 

ggtttgctgcaactgtgcctgaagctctgtgacaatgcaccccccatctctgcaaaagcaaacccccaaaggcctggttt 

ctaaagcaacagcagtttcagcagcaccgtcagagaggcacttcaggccaatcctggaggagccaggagtgacccttaga 

gtgggccccagggacgtcagccttcttggaaaacagctcaaggggtgagggggcctccctcctcctgcctcccccttctc 

ccactcccaaagcagccaggtccctagggagggtcagagaacagatgctgggagtttccagtccccctaaccagaggggg 

tcacaaggaagatgtgcagaatgaacatcctgggaaactgggaaatgactagggaggaacatggtgcctcccccccagca 

aaaaaaattatacccttccccatgagatggagtgtcagcaagcttccaggccccagcccagggtgcaggaagaggagcta 

cacccattcacctctggtcagcatgctcccaacgtctgagacctcatttctcattccttcttcagtccccattctcattg 

tggttcgaggtctttctctctgagccaggaactccgcaaccttccaccctccctacctcttcctctcctaccccagcctg 

gggctcagtcctggctcaagcactcagtccagcagaagcactgtgtagcctcccattaaagctcacgcctgtgaaaagaa 

cacccattgaggccttgagatggggccacactgacccgctgactctcaggactggacacagcagaggccacacatactca 

gaacaaagcctggaaaggcaaggctggaggtcagtagttgtggcagcttcacatcaactcagctttaatgtgatttaatt 

tccttctccctccagtgggccaaaggtgcaaagataagtatggctgttctctctccttctaacagtgaggtgctgggggt 

999g9tgggggaatatggagaagggaccctcaccacccacaccttcctgcctccccaacaagtgctgccctcctctgccc 

agcattctccccactttgccctcagctagtgggtgcttagcctccagatagcatgccccacctaggccctgccctgggcc 

tgtgatccagaggtcccaagaagcagaggccaggGtggatccagggggtcagccaaggtgagggtgggagcacacaggat 

tatctcccagggacagggctgctgcctcgtagctcaggatggatagaatgtggggggatatccagctacattttccctcc 

acaaaagaccagaatgggagggggatggggtgctgccccgactttcttcaactccccggagcagaaaaatgccctacctc 

cactttccagtgccaagattcaagaagaaaggcaagcggagacttccctttctcagtccctgcttactaatggaaacacg 

ggtccagaacctaaatccagtccctcctccttcataccaccgggagggaggtgcagcccaagcccccgaggccccaaggg 

tccaggtgtaggaccctttatcctctccggcagccatccctgtgggtgtggcacccccgccacaccccattcttgtcatc 

tcagggggaggggggaaatgtaatcggacatcccccccatccaatccatcctgagctgcgaggcggcggctctgtccctc 

ggagataatcctgtgcactccccaccttcactcacctcggctgacgcaggagtctccggagcccgcactcccagacatca 

ctgccctccttcctgagggtgctgaggctcggaggctcagagatgctactggtccaaggtcatgcagcgaggcagcggca 

atgaacagggtcgtggggaggagggggcgctgaccccattacgcccccgccctcactaccgcactcatgcccccggacaa 

tcgcttcgcggaaaacaccccagctgccaccagttaacggtcactgcgccccgggaacctgacatcactctagttcaggt 

gctggggagtcttccaggcccggcccccaacagcggagcccccccaccccagccctgccatcacggccgctggggtccca 

ggcactgactgctcaggacagggccgggaccggggagggggacctcggcgaacgggaaggaaccgggaggcagggagcag 

a9g99tggcctcaccttgcggggtgcaccccggggccggggagggcagggacgaccactgcagcggcggcggctgcagga 

gctcaacgccgagcacgaggaagggagccccgcgccgcggccgccctcccgtcggcacgcccccgcctccgcccattggt 

tgatctgggagggtggggcgagggacgctccggaccaatgagcgggctccaaagaacggccaactggcgagggccgccta 

cgtcacgtgccagggtcgccgaggcagcgccctgctagtccgcgcctgccgggcgagctctcgcgaggaagacgggcagg 

cggcccaactaggccaggggccagaaccgaccactcgaagagggagaaggagggcctcggataggccccgcccccgctcc 

ttcttccgcctgggggatagcgcctctagccttgaaccttgcttaggacgcacctcccttgggcccttcgctctcgggag 

ggctgtcgggcgcgtctcggggctgggtggagctcccgaaggtggcctttctccctgggcttccacgccggcttcggcca 

tcgatacgggccgtgttggtctcgttcaggagctgaggaaccctccatcactcctgtttcgaccccagggtttggacctc 

ttccccttccaccccatcccctgtcttgaaagaagcaacccccgtgcgggcccgagacgcgtcccgggtgcctggccggg 

cctggagaagcatcagaacaaagaaggcacgcgggctgggggctgggagagcctgtgacgcgcccccggggaccgcagcc 

tctgctcccggtctccatggaggcggtcgccatggcagcgagatgcgcctcgctcagcaccgcggggtgggatgtgggcg 

cctgcaatgagccgaggagcgagaggcgtggccctccggtctgcgggggttcttgccggtgctctccgcccgccggttcg 

cgaacaccccacctatacttcgcccgtggggacggattccccaaagtgccctcagtaagccgtccggagcacgcagcgcc 

ttgcttccaacggaactagagagacggcctgggcggccgaaggccagcctcccttcagcagggccggggtcgctgcctta 

aaggagccctcaagtctgccaccctgtggcccataacctgtctgctgatctccagtctgcacactgttggcaaattaatc 

tttctgagctcttgttttcatcgcgtccctctcctgctccaaagccctctgggactgcctccagtagcgGttcacaaact 

tcagcagcactttgggtgactcatgtgccctcgcgtttgagagacagcgcttacattgagagcttttcacattctgatct 

cagcccatccactcctgcccactctacccactcccaccacactacctttagaccttgtcttgaaagtctaaagttggcag 

acggtgggcccaaagtggcccgtaggtgtattttgtttagctcacatggggctttttaaaagaagtccttccccctcctg 

cctcctccgcctctttctggaatcaggaatctggtttcgtggcttttgaaatatcagaagatctgacaacgtgggctcac 
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tttccagcctggcattacaagcccctttaagagcacataaatttaccatagtccccgctaatccctgtttctcacggtta 

gttaccttctgggtctgtgcagacatttgtgatcccctccccctcattatcccaccttccaacctggaaggccctccccg 

ctctcttctcatagccaaaattcaagccctcttcaaccaagataagttttgcctatttatttatttatttatttttgaga 

cagagtctcactctgtcgcccaggcttgagttcagtggtgccatctcggctgactacagcGtccacccGGtcccccgccg 

ggttcaagccattctcctgcctcagcctcccaagtagctgggattctaggtgcgcgccaccacacccagggctaattttt 

gtatttttagtggagacagggtttcaccatgttggtcaggtgggtctcaaactcctagcctcaagtgatctgcctgcctc 

agcctcccaaagtgctgggattacaggcatgagccaccgcgcctggcccaagcctgcctttttcaaaagtctttctagcc 

aacttctcaaaaccatctctggtgtgggctctctcaaacacaaactggctgctaactgcacagcccggggttatttccga 

attgtcaactcctttcagcaaacatttactgaatgtatctgatgagtaagttattgtgctaggctctgtggagtggaagg 

cactcacagtttagtgtaggagggacaccacacacaaagaactgtggtctcctgtccttgccaaatgttcctcacactcc 

caggcatcgggcagtctggcccactgggaggagctgaaatacagagctgcccatgcctagggatactacatgctctcagg 

gaaggccaaagatggttctggaaagcctccaggagaaaattagactctttgtgtcaggaggaagtcagggcaggagttac 

ctggctgccccatcactgcccaatgtgtctgtgttaccacacgagtagtgcccaggctctggggcttcctgtgcggaggg 

ccttaccaaagatagatggctctgcaagggaaaccacaggcccctaccaggccccctggagacagaggcaggacaagaag 

aatcaggaacaactccaggttaggagaacagatggagcacgtataaaacatccactcacggccgggcgtggtggctcacg 

cctgtaatcccagcactttgggaggctgagatgggcggatcatgaggtcaggagatcaagaccatcctggctaacacggt 

gaaaccccgtctctactaaaaatacaaaaaattagccgggcatggtggctgatgcctgtaatcccaactactccggaagc 

tgaggcaggagaatggcgtgaaccccgggaggcggagcttgcagtgagcccgagattgcgccactgcacttcagctttgg 

gcgactgagccagactccatctccaaaaaaaaaaaaaaaaaaatccactcaccatgagaacctggtgacttttaaaaaaa 

aacaaaaaaacacaacacttgctacaactccaaacacacggttttttgtttttttttttatttttcagagacagggtctc 

actatgttgcccaggctggtctcaaactcttggcctcaagcgatcccccgaccttgggctcctaatgtgctgggaccaga 

ccagaggcacaagccactgtgcccagccctctattctttttaattaaaaattaaacaactataacctcattggaaaactg 

cttggcagtacctaataaaaatgataaatgaaatgagtaattccactttttttttttttttgagacagagtcttactctg 

tcacccaggctggaatgcagtggtgcgatctcagctctctacaacctctgcctcctgggctcaagcactcctcccacctc 

agcctccccagtagccaggactacaggcacatgccaccacacccagctaatttttgtattttttttttttttttgagatt 

acaggcatgtgccaccacgccggctaattttgtatttttagtagagacgggtttctccatgttggtcacgctagtctcga 

actcccaacctcaggtaaactgcctgcctcagcctcccaaagtgctgggattacaggcatgagccaccgtgcctggccaa 

tttttttgtatttttaatagagacggggtttcaccatgttggccaggctggtctcgaactcctgacctcaggtgaccgac 

ccacctcagactcccaaagtgctgggattacaagcttgagccaccatgcccagccatggcatgtgtttaattttaaaaga 

aactggcaaattgttttccaaaatgactgtacttacttacactcccattagcagagtataagagtttctgttgcttcaca 

tcctcatcaacataaaaatttgcaattttgtaaaaaacgctacaccatgctaaccaaacaaagcagggcaacggactacc 

agtcttcagcctctcctctgagccatgcccttgtagaggacaatttcacaacagctatcaagattacaaatgcacacaca 

aaaaaagagaaaacacacacacacacacacacacacacacacacagccaacatggtgaaaccccatctctactaaaaata 

caaaaaattagctgggcgtggtggcaggtgcctgtaatgccagctacccaggaggttgaggcaggagaatcgcttgaacc 

cgggaggcagaggatgcagtgaggcaagatagtgtcattgcactccagcctgggccacagcgcgagactctgactcaaaa 

aagaaaaaagagagaggaaaaaaaacacaacaaagattacagatgcatatattctttgatctaggaaccccagttctgga 

atttatcatttagatatacttccccatgatactaaatcatgagtgtatgaggttatttattgttgcattcttgtgatagc 

aaaagattggaaataactcaaatgtccataaatagaagacttcttaaataaatttggtacagacaatataatgttatatg 

gctttaaaaagaaaaaaaaaaaagaggaagttttcaatgtcctcccaagtagccaggattacaggcgtgagccaccatgc 

cgggataatttttgtatttttagtagagatggggttttgccatgttatacaagctggtctcgaactcctggcctcaagcg 

atccacccacctcagcctcccaaaatgctaggattacaggcatgagctaccgtgcccagcctgctaaggaaagattttgt 

attagctaggattctccagagaaacagaactaggaaagagggcagataggaaagactgacatattataaggaattgactc 

acacaattatggagtctgaccagtcccaagagctgcaggttgggttggcaagttggagacccaggagagccaatggttta 

gttccagtctgagtcccaatgcctgagagccaggaaagtcaatgatgtatctccagtccaaagggcagcaggtttgagac 

caaggaagaatcaatgtttcagtttgatcccaaaggcgagaaaaaagctgatgttccagttcaaagaccaccaggcaata 

aaaattctctcttatttggggagggccagcctttttcttctatttaggtctcaactgactggatgtggcccactcacatt 

agagagggaaatctgtttcactcatctgctgatttctttcttttttccttttgtttttgaaatggattctcactctgttg 

cccaggctggagtgccgtggcatgatcttggttccctgcaacctccacctcccgggttcaagcgattctcctgcctcagc 

ttcctgagtagctggggttacaggcaatcaccaccacgcccggctaatttttgtatttttagtagagacggggtttcgcc 

atgttgaccaggctggtcttgaatgcctgacctcaggtgattagcccgcctcggcctcccaaagtgctgggattacaggc 

gtgagccaccgtgcccggcctcatctgctgatttaaatgttaaccttagccgggtgcggtggtttacccctgttttccca 

gcactttgggaggccgaggcgggcggctcactcgagctcaggtgttcgggtcaacatggtgaaacctgcctctactaaat 

acataaaaattagctgggcatggtggcatgagcctgtagtcccagctactcaggaggccgaggcacgagaatcacttgaa 
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cccaggaggtggagggtgcagttaactgaaattgtgccactacactccagccagggtgacaaaacaagactcttctctca 

aaaataagagtaagacaaatgtgcatgtttgtttatagtGacataaagtcattccggaaggatacataaaaactaaagat 

agaggttccctattagagatgtgctgggaataagaagacttttcactttttatgtgttttttttaaatcatatatatgtt 

ttatctattcaaaacattaagtaacaaagtttaaatacccttgcatgccttcctctcctgtagggttgtgaacactggag 

ggaacaaatcatgttttatactcacttttgtattcctggtatttagtacatagtaggtactcaataaatagaggaaagaa 

cgaatgaatgaaccagaagatagcagatttagtgtcggttagagagaggagaggggccaggcacagtggctcacacatgt 

aatcccagcattttgggaggctgaggcaagaggctcacttgaacctaggagttggagaccagcctgggtaacaaagcgaa 

atcctgtctctccaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaagcccaggagcagtggctcacacctgtaatcccagca 

ctttgggaggccgaggtggggtggatcacctgaggttagcagttcgagaccaactaggccaacatggtgaaacaccatct 

ctactgaagatacaaaaattagttgggcgtagtggcaggtgcctgtaattccagcaactcaggaggctgaggcaggagaa 

tagcttgaacctgggaggcagaggttgcagtgagccgagatcatgccgctgcactccagcctggatgacagagtgagaca 

tctcataaaaaaaaaaaattagctggatatggtggtatgcacctgtcctcccacctattctagaggctgaggtgtaaaga 

ttgcttgaacttgggaggctggggttgcagtgagccaagattgtgccactgcactccagcctgggcgacagagtaaaacc 

ctgtctctaaaaaagaagggagttagggctgggtgcggtggctcacggctgtaatcccagcactttaggaggccgaggct 

ggtggatcacctgaggtcaggagtttgagaccagcctgaccaatatggtggaaccccgtctctactaaaaatacaaaaag 

tagccgggtgtggtggcgtgtgcctgtaatcccagctacttgggaggctgagacagaggaatcacttgaaccagggaagt 

ggagagtgcagtgagccgagatcataccactgcactccagcctgggtgacagagcaatactctgtctcaaaacaaaaaaa 

aagaaagaaagaaaaattttttaaaaattaaattaaaaataagggagttataatcttatctaatgtaatcacaaaagttg 

acatcctatcacatttgctgtattctacttatcagaagcaagtcactaagtccagcctacattcaaggaaagggtacaca 

tgggggtgacgtccaggaggcagggattactgggagccacgttaggagctgtctactacaaggaggaaaaagcaatgaaa 

ggatgaaatctaggatggtggttatctctgagtggaagcaagggggagagatgcaagaagcacatggccggtgaaagtta 

tttgtaataattccattcttggggcaggtggtggatttataggtgttcataagtgagttgattatggacaaatggggaga 

gagtgtcatgaaccaaggattatatttagtccaattcagtaatatttaaaatattcaataataaatgaaataaaaattgc 

caactactcctgaccttgttctgagccccatccatgcactccacctggtcttagtttaacactggctccacctgaccctc 

cacttgccccaaactaaccgttagcccttcacacacactgacccttgctatagcaccacgatctctccatagtctgcaat 

aggcccccggagttctattccctgaaacttttatccatgtaacctcacctgtaggctgctatatgtaaagatagtaggaa 

tggctgatggagtggcccgaggaccaggagacagggcaggGtaacagttctgccttagcagtagcactgtgggcccagaa 

gacaggtttgatggctggtatcacgacctcacctattggcctcctacatggcaaaggctgggccaggtcctgatggttct 

tcctctttgaggcttctgctcctggtatccagatatagctaagtctgtctctgagaagtgcacctctgccacagcaccag 

gcctgtaactcccttgttggtaatagactctatcaactctctcctttcctgtctttgatgaagtaagctaccatactgga 

gatggctatgtgtcaaggaaatgagagtggctattggccaacaatgagtaaaacactgaagctttcagtttgacagccca 

ctaggaagcaaattcagtcagcaaccaagtaagcctggaagcctggatccttcctcagttgtgccttcagatgagactgc 

agttctattggccctttgcttgcagccttgtaggaaacattgaagtggaggacccagccaagcctggcctggactcttga 

cccacagaaaatgtgagatataataataagtatttgttgttctaagctgctacattggtgataatttgctacacagtaag 

agataatgtatacaggttcttttcatttcttttgctattacaaagagtgtttctgtgggccaggcatggtggctcacgcc 

tgtaatcccagcacttttggaggccaaggcaggtggatcacttgaggtcaggagttcgaggccagcctagccaacatggt 

gaaacctgtctctaccaaaaatataaaaaattagccagatgtggtggcatgcacctgtaatctcacatacttgagaggct 

caggcaggagaatcgcttgaactcaggaggtggaggttgcagtgagccgagatcgtgcaattgcactccagcctggacga 

cagagcaagactccattttaaaaaaaacaaaaccaaacaaaaagagtgttgctatgaacgttgttgtaacatgtctccca 

gaacacagaacacaagagcctaggtttcccttgggtatgtgctgggtatatgcctaggaatggaattgttagatctcagg 

atgtttgttttgttttgagacagagtgtccttctgttgcctaggctggagtgcagtggtgcgatcttggctcactgcaac 

ctctgccccccagcttcaagcaattctcctgcctcagcttcccaagtagctgggactacaggtgcccgccaccacgcctg 

gctaattgttcgtatttttaatagagatgggttttcaccattggccaggctggtcttgaactcctggcctcaagtgaccc 

acctgccttggcctcccaaagtggtgggattacaggcctgagccaccatgcccagctggatctcaggatttacaaatgtt 

ccacattactggtagtcagaaaaccattaaagttagggtcatgccatggctgtagtgtgttggccagggttgggagtaaa 

gcctggcctcaatccatggaagttgggattgtctgtgggcagggttggagtcgtacagagggccgagttaggggtcagtc 

cttacgtagggttgagggtgagtgtgtggccacatttgggaacaatttgaagtctggtttggggttgggtttggacaagg 

ttggagaacactgtgagggtgggtttgggggtcaatctaaggccagggttagaagttaggctgtcatcctgtgcaaaacc 

tgacccctggtgatatcatcaacctaccaagctgtggccgcacaggacccagccacccacaagatgagatccactctggg 

tccagaaagctctttcactcaaggcgggggtggggagatggaggacaagtgaagaagaaactggctcccctcagatgcaa 

aaagagaaggcaggcacaggaaatagaacacaacactgactttaatggggcagccctgagccgtaaccttcaaaccttcc 

gcctcagtaccccgtgcgcgaggagggaggggcgactgctacgggcacatcgtcgatgtcctccctactcctggcgatcc 

cacgcctcaccccttcctctccagctgctgcggtctccgtcgccgaggtgggtcccaggtaagctcccaggaggggcagg 
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ggagcccctcagcggcccgggacagattcccccatggctgagggcatggggaactagcctggatggagacgccgccgtcc 

tcggagctgggcggggacctgattctgggggtgtggacagagccaagggaccgccccccaaggcccagcgccgggagatg 

caaggccgggcccaaggtgttggacactgctttgggggacaggctaggtctctgcacgtggcttctgggctctggaaagc 

ggtccattctcctgacccggatctccggagtggtaggaggcggctcagtcccgggcctgcgcticctagagttcctgtccc 

atctcctcccacgctcacccatcccaaggaaggagggcactcgggccccagcaggctcgtgagagcagcgggctccgccc 

tcccaatggtctatccatcggtgggtgggtccggcgcggcgtcggggctctggcgggtacccgggcgtccccgcgcggcc 

cgggccgccgctcaccgctgcttacgctccgcctgctggagccgccgggagaggtcttcgatccgcacattcttgagctg 

gagcaggtgctccaggcgccttaccttcatctgcaggacctaagccgcaccgcgcaggcgtcaagcctggcggtctgctc 

cctcctgcccggcctctgctggccgcgagcccccacccgcagccttcatagacccgtcacttgcacggtctcttgagagg 

ccaacagctcctgctccttttcagcgatctggaggacgaagctggggtcgccctgcaacgcccggttatacgctggaggc 

cgaggcgccggcggccggtcccagctggggtgggaagcagcttagcggaggcttggacctcgggccctacactcaccctt 

ccagggccctagaccaggccttcccttctccccctctagcccctttcctgcctcccgtctcaggctcagcgtaccgtgcc 

ccccaaccccagctgtgtgcagagatcaccagcctggtgaagtctggaactcccagcttctttacctgagctgaccccct 

cccggggggtccgggacaccttcacctcgggccttctgggatacacctgagacaaggcaagtggaggaatgactgtgctg 

gagtctcattctcctgcttccccctccctacttccactctcacaagtgacaggaagaaggacagaaggaaagctccaacc 

gggagggagaaatggaaaaggccaccttacccacatccatgtagccactgccatcctggggagccagctcctgaaagaac 

cccgggggtccgctgtaagcccgggctcagtcatccccaaggagccccatgtcccttccacgggcccccaggccaggctc 

tattagccaagcacccctcccccacacttatcacccaggctggcctttctgtggtctggggagcgccccccacccagccc 

ccagagcagtgggagaagctggagcggggtggcacctgtaaggagccggcgccctgcttcctgcgcctctgcctctcctc 

caggcgctgcctcagcgggatgagcaccagctccaccacgcctggggcgcactgcgcgatcttgcgcatcacgtcatccg 

gtactgaaaagttcagcctcttcagtaccttcctgaggggtagacattgggatagcagattgggtcctggggaaatgaag 

cgaggggatgggggagagggaagggggcatggtggttgctgttctgagtgtacagccctgggggagtgaaagcgcatcgc 

atctgagcttccttgtgtgggtgttccgaggttccttgcctccttcctaatagaagagtctcttgattttgtaatcggac 

gaattctgtctgcgttgaccctccccaccctgccaccaaaacagtctcagcctgggcaaccaagaccataggagatgatg 

gggctgcacaggtattctgctacctgttcagatgaccccagttgctgagcttctgctggagagagttggcggggacataa 

ttgtgcatctccaccatcttggggaagtaaaacttgatgacctctgcaacaaggactgagggagaggggagcagacatgg 

aaagtgggggcatgggtaaggaaggaggaggtgagagaaggtgggggtgggagaatcaggcatggcagagtggtggaacc 

caatggaggggtggagatgcggaaaggggaaggataaggagggagacaaagacattaagattagtgataggtagaggaga 

ctaagtctccagcatacctcagaggagctgcatcccttccccactaatttaggaatgggaatactgagtctctggtcctc 

cagtagagaatgagtcagccagtgcaggggctcacacctgtaatcccaacactttgggaggctgaggcgggtagatcacc 

tgaggccaggagttcgagaccagcctggccaacatggtgaaaccccatctctactaaaaatacaaaagttagccaggtgt 

ggtggtgcatgtaaacccagctacttgggtggctgaggcagaagaatcgcttgaactcaggaggtggaggttgcagtgag 

ctgagatcatgccactgcactccagcctgggcgacaagagtgagactctgtctcaaaaaaaaaaaaaaaaaaaaaagaga 

atgagtctccaagagaagctttcaggagcccagggaattcacatggatgcatgatgcacacacacacacacacacacaca 

cacacacacactccaacaccaaagttccaggagagtaaaagactcacgtaagcagagacacatgtgataagatcccccta 

cacatagatacatagggcagaatgtgtagacacatagaacaaacacatgctagacacacataggcacacatgtagaacac 

acatagacagaagcacaaaaagccacatatgcacacagcctccccaaatatacagatacagagatgcatagaagtacttg 

acatgaaaagatgacagataagtatttctgtcattgagatcccaggagacattcctagaaacacaaacaccttcacaatc 

acaaagaaacacatgcacacaagacccccccaaatacaagactaaacatgtcaacacatatgaacacacacatgctcaga 

ctcccagaagacatagaacacatagcaatgagtgagcacatacagaaatgcataaatacatatgcagacacatatcttaa 

tcattaatgtccctggagggagttagaggcaccccccccaacatgcacacaggatagaacatgtagacatacatatatgg 

acacacatcgttcacacacagatgcaaatagaaaacaagcacacacatctaaaatttgaagtcccaggaggcagattcct 

ccccactcccacaagcacacaaacacaaagagtttgcacagagacccgttacacacgaacatacaggacagaacatgcag 

acacacaagggcacacacctccatcgctaaagtcccgggagaggtttcgcttgggccgggacagagggatgttgtctacc 

cacaggtacagctggtgcagcgcctcctcgtccacgctgctcgccattggcgtcctcacggcctggccgccccagcggtg 

ccgggtcccgccccagcctcacgacccttcaggcgcttcctttctcgccactgcagaagcccataactgcctctgcctgc 

ctaatccagagactcacgtcccagctgggagcggccatattgttttctgaaaccatggaaaccattcccgactctcctgg 

gtacaaagacttctgggagttgtagttttatcccctccccgctacctcctctcaggcccgagctttgaaaaggcggttag 

ctgcccttgtcttcttcccaagaaagcatcacttctgtgcccacgccacctagtgaccagccactcatccattttgggac 

cagccaaagccaaccaggaccccagaattccgcggatccttctatagtgtcacctaaatgtcgacggccaggc 
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~MAARPLPVSPARALLLALAGA*^LAPCEARGVSLWNEGRADEVVSASVRSGDLVII PVKSFDSKNHLjEVLN I R;aC RESKELI INJ2ER3JE.G3IASS FTET ; 97 

LWPVPGAGVLQGHIPGQPVTPHWVLDGQPWRTVSLEEPVSK^DMGLVAiSEAE|jQELLLE|lEKB]HRaLAPGYIET i 95 

NIGGTEEQQAESEKAPREPLEPQVLQDDLPIS LKKVLQTS Li^E PLR I Kfe;ELD ! V|Dy H X LErjLQgiREtgVPGR PTLV .- 98 

GLVGPVLGAARPGFQQTSHLSSYEIITPWRLTRERREAPRPYSKQVSYVIQAEiKEHIIH2ERfflKDfflL,PEDFVVY : 95 

■HLPAIAPSRPWALMEQYEVVLPRRLPGPRVRRALPSHLGLHOERVSYViGATyHNFTLHgRKERDlLGSGYTET : 8 6 



---MGWRPRRARGTPLLLLLLLf 
MRLALLWALGLLGAGS PLPSWPj 

— mgsgarfpsgtlrvrwl: 
mrglglwl 



00 * 120 

H£LQDgTDg?SLARKYTVILG| 
-SGNPQTTTRKLED 
" "?^VLAPKHT- 
IVSEGHT-- 

JTLITDHPN IQNl 

TAANSSEETEQPRG QDJ 
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|lTLSR& 
,TPE1_, 
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160 + 180 

KSAT NR Y KL FPAKKjiKS VR|4S F 

PDSKGQ mLE'Y§SEH|KPPPi;NB 

pprgskdfst|gesf|^meqbltwk^tb 
[gp gdlqgppipss|iqdS1hlpght| 

-,QNSS HFE""" 

DE — GGEGGRi 



FE$If|YiMDDVYKEPLKi 
GR|AVYQAEHgLQTAjgT| 



jjSHHNT-PNLA 
SFEHSK- PTTR 
|--HRD- PGNK 
VLSWRE- SVHT 
3VSNKDI EKET 
gVSDDSLGSLL 



200 ^ * 

A K N V- F P P PS Q T WAjg|j 
DWALQFTQQTKKRP*| 
AGMTSLPGGPQSRGHi 
QTPPEHPLGQRHIRgljj — HUVVTESm' 
AKDEEEEPPSMTQLLERR&WLPQ§R| 
GPRTAAVFRPRPGDSLPSf^ 



Catalytic Domain 
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::-'KKRPdQDATKHKLIHI 
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; S§}A§K- Y§D FQ H L LN^T LgJV 

:erydmmg|kqtavreemill| 

A^^aMLGSE-AAVRHgjVLSV 



260 
jp&BKF^P! 

fsJYc'HKFi' "' 

JaYgBQLLHtrl 

lLLLHTFFfep 

|2ylBsmS!im 




280 

DMjj^CS^fQDgtFTSl 
! IIGKMCE-/ Hi.'jY Ss 
ijERSRS R^T Q DANA; 
|QRiF;jLVElgPN|lAV| 
1NGNLINIVGGAGD' 
""SQ^RFH^PD^SVgl 
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BWAQRP 
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JSBQMAVEKJ 

!AD CgSASAADS 

-PDGHCVEAAAESL 
PGNSHPHPGPAPAK' 
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'TRMLYGGRrI 

PGLPVPPALJ 

FERW5LPFMAAI 

:pdeaysaps 

iLSHLVGGPVj 




Cysteine-rich Domain 
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600 * 620 

GgAS RfVI GTNAgS I ETNI PLQQSGRI Lj 
IS SEA Rgj-LES NASP I DTT I - IMNwRQIQ 

sllaphm^pvdstvhl-dSqevt 
.tqSllgsi rdllweti dv-nSteln| 



qSr|thi 

TBRgALJ 

.nHswvhi 



640 

hvylh Dr 

HVYRiPEEEG] 

-PSAQLgLLGLi 

hld: " 



580 
VSKSSFAK| 

[NGEHRI 
DSEGHFLPj 

irnpsgsyvs 
|-fsgneyki_ 

kasrySai 

EGF Do main 

680 



, M . LE— 

. QElgVFGIVPAIIQTP SRGTK^®VDFQl|§-' 

GgQQPLGRAICIgjDVCHALTTEDgTAYE 

Transmembrane Domain 





HKQE 



720 

BlRQADNQGgjTIf 
1— gMPPE SVG P WA| 
-gVQAENHDT FLLA 

QLKATSSlTTg 

E T KgY^^Vggg — gT Y NEMNT AgR D| 
''.KLLTEVHAAEBfSLHVL 



Cytoplasmic Domain 



» 7 60 

frig — CLLAAGFWYLKHKTLIRLLFTNffiKTTIE-KLRCVRPSR@P|GFQPC 
J — VLAVLMLMYYCCHQN — NKLGQLQPSALP-SKLRQQ FSCHF&VSQ — 
Hf.SVSLPLLPGAGLAWCCYHLPGAHLQRCSWGCRRD-PACSGPKDGgHSDHPLG 
[LSLlVLLVLVMLGAGYWYgARLHQRLCQLflGPTCQYRAAQSGPSERPGPPQRA 
"iFFFLIVPLIVCAIFIFIKRDQLWRSYFRgKRSQTYESDGKNQANgsgQPGSV 
frLAGIIVYRKARSRILSgjNVAPKTTMGRSNPLFHQAASRVPAKGGAPAPSRG 
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* 800 

QAHIiGHLGKGLMRKPPDSY 

NSGTGHANPTFKLQTPQGK 

GVHPMELGPTATGQPWPLD— 

LLARGTKSQGPAKPPPPRK 



* 820 * 840 * 860 * 880 * 

-ppkdnj§rrllqcqnvdis— — rp lnglnvboh qstqrvlpplhrapraps vparplpakp 

-rkvintpeilrk@sqBpB — rpppdylrggspHablJIahlsraarnspgpgsqiertessrrpppsrpippap 

•pe — nshepsshjge khlh avs p d p qdqvqmhrs clw — 

■ plpadhqgrcpsgdlhgb gagipplvvpsrgaspiptvsslyl- 



^KQ|QQFPSRiP 
'PAgPVTVSSlP 
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812 
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819 
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93*0 



* 900 * 920 * 940 * 960 * 

ALRQAQGTCKPN PPQKPLPADPLARTTRLTHALARTPGQWETGLRLAPLRPAPQYPHQVPRSTHTAYIK i 909 

NCIVSQDFSRPRPPQKALPANPVPGRRSLPRPGGASPLRPPGAGPQQSRPLAALAPKVSPREALKVKAGTRGLQGGRCRVEKTKQFMLLWWTELPEQ : 84 4 
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Signal sequence Pro-domain ^ 

MGW fePRRARG TPLLLLLKLL LLWPVPGAGV LQp HIPGQPV TPHWVLiDGQP 5 0 



WRTVSLEEPV SKPDMGLVAL EAEGQELLLE LEKNHRLLAP GYIETHYGPD 100 

H 

Cysteine switch ^ 

GQPWLAPNH TDHCHYQGRV RGF PD SWWL CTCSGMSGLI TLSRNASYYL 150 



RPWPPRGSKD FS THE I FEME QLLTWKGTCG HRDPGNKAGM TSIiPGGPQSR 200 

Catalytic Domain ; ^ 

GRREARRTRK YLELYIVADH TLFLTRHRNL NHTKQRLLEV ANYVDQLLRT 250 



LDIQVALTGL EVWTERDRSR VTQDANATLW AFLQWRRGLW AQRPHDSAQL 300 



Icff rptejae 



LTGRAFQGAT VGDAPVEGMC RAESSGGVST DHSELPIGAA ATMA&EIGHS 350 



LGLSHDf PDGC CVEAAAESGG CVM&AATGHP FPRVFSACSR RQLRAFFRKG 400 



GGACL SNAPD PGLPVPPALC GNGFVEAGEE CDCGPGQECR DLCCFAHNCS 450 

Disintegrin Domain ^ 

LRPGAQCAHG DCCVRCLLKP AGALCRQAMG DCDLPEFCTG TSSHCPPDVY 500 

Cysteine-rich Domain 

LLDGSPCARG SGYCWDGACP TLEQQCQQLW GPGSHPAPEA CFQWNSAGD 550 

— — ► 

AHGNCGQDSE GHFLPCAGRD ALCGKLQCQG GKPSLLAPHM VPVDSTVHLD 600 



GQEVTCRGAL ALPSAQLDLL GLGLV EPGTQ CGPRMVCQSR RCRKNAFQEL. 650 

EGF-like Domain ^ 

QRCLTACHSH GVCNSNHNCH CAPGWAPPFC DKPGFGGSM0^SGPVQAENHD 7 00 

Transmembrane Domain Cytoplasmic^ 

iTFLLAMLLSV LLPLIiPGAGL AWCCYRjj PGA HLQRCSWGCR RDPACSGPKD 750 

Domain fr- 

GPH RIDHPLGG VHPM EjLGPTA TGQPWPLDPE M ^HEP^SHPE KPLPAVS| PDP 800 

Putative SH2 T S Putative casein kinase It Putative SH3 



Putativa casein kinase It 
phosphorylation site binding domain 
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ggagaatagtggggttgcagtaggaggtagaaggttgggaatgaattggaaaaggagtga 

ctgtgtagccaaggctgtgccttgctcattctaggccaatttcttccttcttgtctctcc 

ctccctccctccctccctccctccctccctccctccctccctccctccctccctacctcc 

ctcctagctatccttccttccttccctttacttctatcttccttctttctttccttccct 

tcccttctcttccccattccccctagaccaggtcccatgtagcccaaactgacttcaatc 

tcattacagatgaccttgaacttcagatcctcagtccccatttctttttgtgcctgaata 

tcgtattttctctgggagcctgtccagtattttcctccaataaatttcattccaggatac 

aactggagtcaattttcttcttgcaaccaaaatgtctaggctagaatgtggagaattcaa 

atattaaacgctagtgtctcagacaccatctgtgggcatgtggccagggagcataaatcg 

cacagtgcttgcttcatactgtctgctgaagtcatccgggccccttcctgctcctcatag 

ctcctcacatgataactcacggggtgggctgctcgctcacagggactggcacaggcttgg 

cacatggcaggtggcacatggttcttgtacatggttgaatagaagcacagaaacccagct 

ctaccagaggcaagacagggagagaacaaaccaaagatggcaagatgttccaggatgacc 

gagagcaaaagccacacctaaagggtggtggctgctgagggtttggggacatgtgacttg 

aatctttgctctcaagtacttccaagagggatgttggagactacttggggtggggctcat 

tgggcccaatcttcagtagtctaaagcccctcagaaataattacttagctgtaattcaag 

cacaatatatacacatggtaagtgtattagcaaaatggctttctatgtaatataaataag 

ataaaaaaacaaacaaacaaaaaacaaaaccagaaataccagcacttcagagtaccccac 

tcccaacccctgcccagtgtagctgctaacttcactaatcaagttaatttaaaaatcaaa 

tataataccacagattttatttttcctgtgtttctgctttatacaaatggtatcatattt 

taaacatgagaacattgtgagatgactagccagtgtctgtctgtctgtctctctcagata 

tttatttatttatttcatgtatgtaggtacactgtcactgtcttcagacacaccagaaga 

ggacattggatcccattacagatggttgtgagccaccatgtggttgctgagaattgaact 

caggacctctggaagagtagtcagtgctcttaaccactgggccatctttccagtcctgac 

cagtgtctcttaatggtgaaatattcttcccttggccctagcattttatccttgggcatt 

ttctcaggaaaaatgaaggcgcgtgtcaccctgagatttgttactaggatgttcataaca 

gtgctacccatcagccaggaactgaaaacaataaatgctcggcagcaggggaatgaccga 

gtagacagagctacgcccacacagctatgagagctgtgcctacagctgagtgtggttctc 

agacggtcagtgggaggggctggcagcgatggctgcctgtggcagttacatgtcctaccg 

ctctattaaaggagctaacaaaggca'accttaaggaaggaaacatttattttgggtcacg 

gttagtaggcacagtctcagagtggttggaaagatggcagcaagtgtctctagctgaggg 

ggcaggagtgggaggccctggtcactttgtgtcggcgaggaagcagacagagaggtgggg 

gtgccgtttgcttcttactttcctcttcctacccagtccaggatgacagaacatcaggta 

ttgccacccacatttggagtgggtcttctggtctctgttaaaactctctggaaacaccct 

cactgacctgatgtgcccggaacaaggtgattctcagcccaagcaagtcacagcgaagac 

caaggacaggtatagttaaggaatgggatagaagtcaatacacagccactggtgggcaaa 

agggatgggcgggagcacaatggagattcgagagccagggaagtgagcaggttcactttg 

tgaaaatctcacatttctgaatctatattttggaaagaaaacttaatttaaaaaaatctg 

ccacttttcctacgccttgtgctgtgtatcgacatgggaggataccttgaggccaggaga 

ctgagatcagcctgagcaacacaataaaacctcatttcttacaaaacagaacaaattggg 

tgtggaggtgcacgcctttaatcccagcacccagggagcagagccaatcacccttgtaac 

catctcaaattcagacccaggggctgtctagcggcagaagagtccatggaacactgtgac 

catagtcatacaattggaaactacttatcgacaagaataaatggccataggtaataggca 

ataggttgggtaaaactcacaaattgggaaaatacaacccatactccagacaagggccaa 

tcgcctttaaacatagaggtcctttaaactgttaaaagaaggaccaaccactcactagaa 

aaatgagctacacatgaatatgttactatatttaaaaaaaaaaaaaacctgtgtctaaaa 
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cagcagcttatgtaagaaattggtagtagtggctttctgtggggaaggagtacccaaagg 

atggagactaaaggaagactctcctaagcattttataccttctcccttgtttgctttgtt 

gtagtggtggtggttgtttttgattcttttttgtttggttggttttggtttttttcaaga 

cagtgtttctttgtggagccctggttgtcctggaactcactcaggctgcccttgaactca 

gagattctcctgcctctgggcttaaaggtgtacctcaccactgtcaggcttgggtttgtt 

tttgagacggggtcttgctgtgtagctccagctgcctctgaagtgctagaattacagaca 

tgaactacagatatctcccagtctcctctcattctccctccctgccccctttgtatacac 

attcacctgtgtgtgcatgtgtgactcactctgagttcttccagtgttactctccatcag 

attttttgagatagggtctcttactgaacctggacctcaaacatttgcctgtctctgtcc 

agtccaacacttggcttacaagtttgtgcaatggttctggctttctctgattgctggaga 

cttgaactcaagaaccaacgcaggtcctttacttactaagccttctccttgatgacaccc 

ctcaccccactgccacacaccaggtctccaggctgatctcacactttttccatataggac 

tttgaacttgtcttttgcctctctgctgggatacaggtgtgtgctgctacacccaattta 

tgggatgctggggatggaactcagggccccaatcttgtatcaactgaggtgcgctcgctc 

tctcgctctcattctctctctctctctctctctctctctctctctctctctctctctccc 

ccctccttcccttccctctctcttccatgagacttctttctatattgcacaggctagcta 

tcttgagtgtcctgtctcgatccaattgcctgacctatcatactggttttctgtttttgg 

ttttggttttttcgagactgggtttctctgtgtagccctggctgtcctggaactcacttt 

gtagaccaggctggcctcgaactcagaaatccgcctgcctctgcctcccgagtgctggga 

ttaaaggcgtgcaccaccacgcctttaatcccagcatttgatgttcgaatcgtataaatg 

ttttttcttctgccgttggaacttttacattccatattctcagaacgaaattattaatca 

tcttatttcaaagttaaatctcttgctatccgaaatcataccaagtgtacgaagtgatct 

tttcaacacgaaagcaatgtccttaaaatttgggaaagggtccctcatgcctgggggtct 

gtggtatagaacaattctgtcctggccatcgctgttcctctgccctgtgcccaccactgc 

attgcctttaccaggaaccacagacttcagcaacgcggccaccacaagacccctaggtac 

ccagatgcatctcagctgcagtgggacagcttcactgctgtgggacactaggtgcaggta 

gacagatcctgggtagaaaggagaatccagatgtcactagggagactgcgaccagagatg 

tgtgctaggcagacagagcagctgtggggatcacttagctagtgcctctattcagcacca 

gagaggttctctaagaaccttcccggactttccagcccatcagccatcccttggccagcg 

tggataggattgttgcagataacgccagggagggtcatcctggctcccgcagcgcgacat 

cctgttttgcgggccagggtagggccacactcctgcgctcctgtgaggtggcgcgaggcc 

cgctgagggcgggaggaggcggccccctctagccggggtggagccaacagcacagcaaga 

gtgagagggaaaggcactcccagccatgggctcgaggtgcgggagacccggggggtctcc 

ggtgctgctattgctgccgctgttgctgccctcgtgtccgctgcggagcgctcggatgtt 

tccaggtgaggatacaggcggcgggggggggggggggggcggctctagggcagggtgaag 

tggtggtggtcatgggttccagtgcaactacagaacatagagttcccgaccctcggctac 

ctggccaggacaggatgtgggaagggtctgtgctcctggtaagggtggccctggagcaga 

cttgcccacacttacttagcaaagcaaataggcaacactaccctttcctcagtgctgccc 

agctctgtccggaagcttctgcacttccaaactgccaggccagagtgtctccctgctcct 

ggaaggacctggggtccttggagtcctggtctgcccacaggcctagctgtctcattcggc 

tcgtcctcttctcatgctgggtctgggcagtgtgtatggtgtttatacagttgccttatg 

tggcagttcttcaaacaggggtgtctgcctagccgtgcctggggctaccactttctcttc 

ctactgtaacctctccttactccgggacgacagggctcttagcatctccaagcctgtagg 

gtgaatctccaaactccacatagcaagcactaggctggccagaaggcagaaggaccaccg 

gggaggaggcagtaagggctgagggctggatattgggcagtctggaaaccaccccatatc 

tgttgactgctgcttccgtaaggcttctgagggggctgactgacagctgtattgccttta 

gcctgcggatagttaatggtgaggagagagagcctctgtgtttcctaatgcagactggct 
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tcctcagtttccactttagtccaccggactcctagcttgcctccttccttgtcttgcagg 

gctgcacagagaacactcacacctgaatccttatcctgcacccttggtaatccgaggcca 

gcaagacttggttctccatactcagttaagccttctcttggcggagagatctttaacctc 

tcaggaaggcttctctgtccttccttctcctcctctcccttctctctcttcagcctccat 

tccaaagccacacttgcacatccccacttgcacatggctcactaattacgaagcacatgt 

gactttaaggacgaggttcctggagctgctgctatttcagattttgaaaatgtatatgag 

cgttttgcctgcacatgcatctgtgtaccgcggaggtaccaggaaaacctgggggccgga 

agagagaggcggatcccctggagctggggttacagatagttatgaggcaccatgtgggtg 

ctgagaactgaacccggtcctcaggaagaacagcgagtgctctcaatccccgatccacct 

ctccagctcccattccagaatcttgatgtctgggatctagggaggcggtggaacaaggag 

ctgggatgacccaggaagggctgagccaacagagctctccactcaggcaaggccagagct 

ggagtggctggggtttctatgtttttttcctttttttttctgtaacctgtgccaactgga 

agatgcagcggcagagggataagtgggcttgctgctatctctgcaggagcttggctcctg 

caggataggtcctgtgagtgtttctgggaccagataggagatggggaggtaggaacattc 

ctgaagaacattccaacaccgtctgagtaatgcttggctccgattctcttcctggagccc 

cttggaaacgcctggcccaggatcactgagcctctgaaccctagctccttcaggttctgg 

gccaggtccgtcagagtcccaacaccagcatcaaccctaatgggagtcaggagctgtggg 

acagacatttggtggctgtcttactcagatggcaaaggaggcctccaaacacctccctca 

ccataaaggcatctactacccagtctaaaaatacccaaggctgcctccttcttcccagcc 

cctctgcgctggggtctgggcttcttcccgggacctctgacctcaaaccagctctctaag 

ctctggccgcttcaaattccccaatggggatggttcctggtgcctcggtttctcttttgc 

cacctccctggttgcttctccatttaggggatgccccatgaataccaaatcctggaagtg 

ctacacatggagtccagctttcaggacccctccttccccgatggcttcctgctcctaact 

gcaattttaacactattctctgctttcaccccaggaaatgcccatggagagctagtcact 

ccccactggatcctggagggcagactctggctcaaggtcaccctggaggagccggtaagt 

atccgttatcctgtccttattaggggccctaggtctcccgctctctgtcctttcccccaa 

atgcagcgtgacagtttttctggggttctcagaaccaaatcactgtaccgttaagccact 

ccctgttgggtgggtgggacaaagagacatctagatggcgtgaggctgcacacatggaag 

tcattttgggcttcctccctcccacccccttctattcctggccctaaccctcttctccat 

gggggaacattcaggccatgcccactcaactaagcaaagatgagttcatagtttggggcc 

taaagcctttcaggagagctctgtgtgtgtgtgtgtgtgtgtgtgtctgtgtgtgtgtgt 

gtgtgtgtgtattctgttctgttttgttttgattgagacagggtcttgctatgtagctct 

ggggggcttcaagtcatgagccatcatgccagccacaggagatttttaaaagaaggacac 

caaaatgtcctctttggccgtgtttatggggaacatgtgactccgtggacatgctgctgt 

cctgtaacctcagcttccatggctggaagccacctgggacaaggaagctgtgggccattc 

tcctcctgtccccagatgtattctcactctggtttgcttctccagagccctgctcagcct 

cctcctaacttctgctcattgtgttcccccagaagagccatgtgtgccttcactgcttcg 

ctgcccatggggaggccggcactgatccgcctggcttacagagaggcaagcgagccccac 

agagcttgtaaagttgctcatgcttgaatagttcattgggcctagaaacctgaggctccg 

tgggacctaagggcttcagatgattggctgcctccccttctttcctataactactcaggc 

tggagagagttcaattctgtacctcctgacagagggcagccccactgtccaaacctagag 

tgttgcaacagtgaggatgcattcccagaaacctaaaccttccaagtcagggccttcact 

ggggtcccccagacttctaggacttctctcaaccaagcaaagatggcttgtggacatgca 

cgggttcccctgatggtggcctctctccttgcctggctcggtctgtgagagaatcccagg 

gggactctgctctgcgttaggaagcctgtgaggccaggccaggaagagctttggcagggt 

gtgtgtatttcacaaacagggttgtactgcaggatggggatggtgcacaaagggggaggg 

agaccctggagcagaagtaaaggcagcaagggccgcagagggaagggaccttccactggg 
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gctactgcctttctcccagagcagacattttcccataaagcaagaggcactccaaccata 
taagctcatgtttgggcccttgtggagcctgtggctggggaagtgagggccatcttctct 
acacctgctgcagaagggccctgaaagattcttggccagggtcccagcccagtgcatttt 
gggataaaaagggaaagccatcgtgggtggggaaaacatttaaaaaaaaaatacagcagc 
ctcccctggaatctcttgggctagttccagttctggcttctagccaggctaagtggacta 
gcctgagagagaccaagtcagtgagagaggagaggtggctagagggccaaggccagccct 
tctgacatctagctaagagagtcaacactttttagggagccaaagttgggtttcatgttt 
acttcatgagttcagttatgaggccagctcagaggaatttcagaggatcgggcagtttgt 
cagaactgaaggtggaggaaaaagtttgggttctctcaggaatgagggaaaggcagcaag 
atggagagatgtgcaaccaggagctgtggatgcagctgggtgatttggtgcttgcttggc 
atgtgcaaggccctgggtttgatctatagcacccacagcaaactagcaagcaagcaaaca 
agcatacaaacaaaacagaagcaaggttgcaggtagtacaagagatggccaaagctgtag 
cccacccaatgacccacatcaccctcagtgcccactgcctcctgcttttcctgccttttc 
atggggctcctgactatggccatagcatatgtccatagcagacataacaacatatgccag 
gcaacgtagcaactgctatgtccgtatatggtggttatgcatctttaagggtccgttgtc 
acaaacacaagcactgagaacatctctaagttacaactcccacacatatccactccctgc 
aatgcgagagccagctgttcctatgagctttctcactaggcagctctacatcctaccggt 
tcctgggcagccaggtggccttggggcctagtgtgtcactgcgttccttctcggtcagat 
cttgaagcctgactcggtgctggtggctttagaggctgaaggccaggatctcctgcttga 
actggagaagaagcagtgagtaccagcgggggggttgctgaagtccagacaaagaccctc 
tctggagaggatcagtgctttctgggagggggttgggggctgggtgggaagcagcagtgg 
gagtgacagggagagtggcggatgtacttggggctacagtggactgaacccaactgtagg 
aagtacatacctggtcatctcataccctggagcagctggatgggcggctccccgctggag 
aagagtgagcaccagaacagacattgactgataccttaattcaaagggaattcttaggca 
aagggaacttccacagatggcagaagaaaaagctgcgaaggtcaaatccaatggtgacag 
ttcatctggtctacctggaatgcctccatctgcgtggaagagaacagagacatctggtct 
acctggaatgcctccatctgcgtggaagggaacagagaggagagggctggggtggggggc 
ttcccagtggggccgatcgaagagggcatcttagtgctaaagatcttggatggacctaaa 
gagtcaaagaacgggaggtggtaaaggagaaacacagtgtcgaaggagtgcagagggtaa 
gcaagatggaggcgtggaactgcagactccaccaccaaccacaccactcgctcatgtctg 
ctttcttacggacattgcagaaggcactcattcattttattctctccatggaatcagaca 
gactgggaacaaactgcaaacaccacgccaaaccacagttcctgacagtgccacatccct 
tcctactaaacggtcttgtacatgtgtgcacacattgaacgttagcatgtatgagtgcag 
ccagcaaagcagcagagttgtaatgtgtgtaaggatggctggccgtacaacagcctgggt 
ttttgatggaggacaatgtgaatttggagtcaggattttctgtaaggaagagcaattgaa 
gtcaggcatggaggtagaggcctgtaagtccggctcctctgggcaggagaacacagagtc 
aagtcagggactgcctgagttataaagaatgagttcaagaccagcctaggcaacttagca 
aaatccagtctcaaaatataaaggagtgaggaagagggaaagagaaaaagagggagaagg 
agaaggaggaaaagaaaagaaagaaggaaagaaagaaagagaaggaaggaagaaaggaag 
gaaggaagataaagaaagagagagagacagacagacagaaagaaagacagaagacagaaa 
gaaagacagaaagaaagaaagaaaggcagaaaagcagctatttatttatggttgcaatgt 
tcctggtggggacagggaagaagcctgaaaatggaaacattactgatagagtctagctcc 
aggtaaagggacccttccccgggtggctgaacagaagcagctacaggcaggtggaagtcg 
•gaagtacgcatcactgagggacagatggagacatgagatctagcagtctgaggagaactg 
■ gagggcctggggtccgctcagcaaaggcaggagtaccctggccctacttggcctctctga 
ggtagtatctcctggcacacacacacacacacacacacacacacacacacacacacacac 
acctgcctgcagcaggctccacagcaccaaaggtccccagcctgtccttctgggtgcttc 
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ccacacttcccatccccttgactaacccacctcaggactcgctccagacctttgtcttct 

ttttgcctgaggttttctctttgaggcacagtgtctcctttctctctgacacactgaatg 

cctccgctagcactcagggaggagagctggaagcaagcggcaggatgcaggtctaaatcc 

agtgactcctaaccctccccctttccagcaagcttctggccccaggatacacagaaaccc 

actacaggccagatgggcatccggtagtgctgtcccccaaccacacggtgagacactttc 

atgggatctgagaacatggggagggtgcagcctcaatgcctaccctgtcctgttttgcat 

tcctaaggaggttgtagtctgccttcaggccccacttcctccttgtcccaccaacttgag 

ctcctactcagatgtgggtatgagcctcttcctctctaggatcattgccaatatcacggg 

cgtgtgaggggcttccgggaatcctgggtggttctcagcacctgctctgggatgaggtaa 

ggcgttgggggagagggcctggggctgcggcagaggtgagcgcttcctgttcacacatcc 

ttgctatccactgcactctagtggccttattgtgctcagcagcaaagtcagctattatct 

gcaacctcggactcctggggataccaaagacttcccaacccacgagatcttccggatgga 

gcagttgttcacctggagaggggtccagagagacaagaactcccaatacaaagcaggaat 

ggccagtcttcctcatgtcccccagagccgggtaaggggtacagactagagtgggactgg 

tccttcaaagcagaggacacaggaatgcgggtcctggacggggtctccctgttttaattc 

agaggctgaagaatcaagtttgagcagaagtttaactatcccctcccctagcccgggggg 

atcacggtgtgtggccaaggattcggtcaagcagcctgcccctcaccatttgctcctccc 

aggtgaggcgagaggcgcgcaggagtcccaggtacctggaactgta"catagtggctgacc 

acaccctggtgaggagagacctaagaggtctgctgggtggggggactgctcagcaccgtt 

atgaccagggacctctctgccttctcagttcttgcttcagcatcagaacttgaaccacac 

gagacagcgcctcctggaggttgccaattgcgtggaccaggttggatgattagggcgctg 

aaggttgggagtgaggcagttgaaaagctctggtgagtgcctggttcctggcttcagcag 

gtctctccctcagattctcaggactctggatatacagttggtgttgaccgggctggaagt 

gtggaccgagcaggatctcagtcgcatcactcaggacgcaaacgaaacgctctgggcttt 

cctacagtggcgccgcggggtgtgggccaggagaccacacgactccacacaactgctcac 

gtgggtgccatttacacacacacacacacacacacacacacacacacacacacacacaca 

caccccggttgcgggatgtccttatttccaactccacccagtcacgccctgctccatcat 

ccttaggggccgcaccttccagggtaccacggtgggcctggcacctgtggagggcatatg 

ccgcgcggagagctccggaggtgtgagcacagtaagctctgctggggacctaggaggaga 

gaggtttgggtagggtttgtggtcctcagtgaaacccggctcctcaggaccactcggaac 

tccccatcggcacagcagccaccatggcccacgagataggccacagcctgggcctccacc 

atgatcccgagggctgctgcgtgcaggccgatgcagagcaaggaggctgcgtcatggagg 

cagccacagggtacgcaaagaaggtcgttgttaccggacctagacaaagcgactgcgtat 

ctttggttacccctcagttccctctttagtaaagataacagcctgctaatagggcctcgg 

tgggaaagatgctgataaggtggagtctcagaatgactgcgggactagggtactccactc 

tgatGcttggtcctccttggggcccggactgcccgattttagcgtcacagcctcctgtct 

ccagttcctaggctcctctctctctcgattgctcgcttagcttagaaagaaacagcacat 

cttgagggctgtgtggctaacctcagggcccagagtgcagaagtagcctgagcttaggag 

gtggctccagcacccgagatggtccccacaccaccaatgacatccatcttccatctgcag 

gcaccctttcccgcgcgtcttcagcgcctgcagccgccgccagcj^gcgcaccttcttccg 

caaagggggcggtccttgcctctccaacacctcggcgccggggctcctggtgctgcccag 

ccgctgcggaaacggcttcttggaagcaggagaagagtgcgactgcggttctggccaggt 

cagatcgtcatcttcgtccctggggttcaaggctaacccatgctcccagcctttcccagg 

tctagcttgctccaccatcacgtgttctgttctctcctttctgacttcagaagtgcccag 

acccctgctgctttgcccacaattgctccctgcgtgcgggggctcaatgtgcccacggtg 

attgctgtgcgaggtgcctggtgaggatactggggagcgccctgtacatcttagctgggc 

tgggacttctagtcccttttctgagtgtgagtttgacccagtgtatgggttcacgtagca 
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ctccctcgggctttcagcaaagatctctctgctttcccttaatgggtctctggtgtagtt 

aaagtccgcgggcacgccttgtcgtcctgctgcgactgactgcgatctccccgagttctg 

caccggcacctccccgtattgccccgcagatgtttacctactggatggctcaccctgcgc 

tgagggtcgcggctattgcctagacggctggtgtcccacgctggagcagcagtgccagca 

gctatgggggcctggt gaga ccgcacgctggtcctggtgccctgaccaat act aaaacct 

gcggttttctactgagggcaagctccacccgtggaactgaggcccgagctgcccgcttct 

tactccccctccccccagggtccaagccggccccagagccatgtttccagcagatgaact 

ccatggggaattcgcaagggaactgtggccaggaccacaagggtagcttcctgccttgtg 

ctcagaggtgaggtgtgatgctgagggtctgcagctgtaaagtagggcggagcatgcgga 

gggaacactccaagttgttgaccaccttccacttcctccccagggacgctctgtgtggga 

aactgctgtgccagggaggggagccgaacccactagtgccgcacatagtgactatggact 

ccacaattctcctagagggccgcgaagtggtttgccgaggggcctttgtgctcccagata 

gtcacctggaccagcttgacttgggtctggtagagccaggcaccggctgtggacctagaa 

tggtgagccctgcccacccaacccctcctggttattgagtcctccatgccaaagtgttct 

cctcactgcccagtgggcacaatgcccataggtgtgccaggacaggcactgtcagaatgc 

tacctcccaggagctggaacgttgcttgactgcctgccataacggtggggtgagtagcct 

aaggggtcagggtgaccttggaggtccttgctacctggtgacttttctatcctcatctta 

ggtttgcaatagcaatcgtaactgtcactgtgctgctggctgggctccacccttctgtga 

caagcctggcttgggtggtagcgtggatagtggccctgcacagtctgcaagtatcccaat 

ggggtgggggcaggcaggaaccacctgggcagtagcctgcttgagactcagcacccctgc 

cctccacagaccgagatgccttccccttggccatgctcctcagcttcctgctgcctctgc 

tcccttgggcttgcctagcctggtgctactaccagctcccaacattcttgtcatcgaagg 

gactgtgctgcaggagggaccccctatggaataggtaggttcggtgctcaggtctctctt 

cctgagcctgcccccatggctcctgcttctcagaactcttcagggctttgtagagtgaga 

ggctacagggagctggggctttaggaagctagatgggatccttattctctagatgtagtg 

agagctcccaggctgtgggaagaagtccgtggtgtgtatcactgccctgaccagcacttg 

gtggtgtggtcctttccatgggcttcccttgatttttttgttttatgttttttgttgttg 

ttggtttttttctgctttaaagaaattcaacacagcctgtgctccgctttgtcctgaaga 

cgactctagcttccttgtctcaatatcagccatctcccatggcctttgccctaattactc 

ccttcaggtccctcggtgctagccaaggacccctttgtgctccctataaactggccagct 

atagtgttgctctttctgtgccaggactctgtgcctctgtcccctgtagatcacagtgct 

gtaaacccaattttctggcaggcagaaatgccacctgatagcacacattatctaccgagg 

gcagcctcctcactaggcccctcgggcaagtcaggcacattatgtccctgtctgaacttg 

gaagtgttctagaccaattgagggaagggcagggttgggttggagatgtgactagagggc 

acctcaggccagaaacagcaccagcaggccccaggagccagtgagacggtctggggaaag 

ccaggtagtgcctggggggtgggggcggtgtctgcagacagggaaacaggtggagtgaca 

gttgggagggggcacttcagaggggtggcagctgcacaccgttatcgggatagggtgtca 

aggacagtgggaatatctggatgaaccatccaaagaaatggcaaggtctgtgagaaaggt 

cccggcagttcagagtctagactggcagagcacggctaaacggagcatgggaaggcacag 

ttcccaccaagggagacatctctgcacttcagctctaggtgggccctcggtgacgcctac 

atctagactgagtgggggctggagtggagtcacctcgggagaaaggatgtccacagccct 

ggaggccctgggacataagtgaggtctggacatcttaaggacagaacaggaatgtggaat 

tcgaagcttgagtggaatggagaagcaatccccctttgtctcatacacggtcactttcca 

accttctgaactcttatctggtctgtcactggcccctgcagagacatacccctgggcagt 

gtgcatccggtggagtttggctccatcatcactggagagccctcgccccctcgtaagtgt 

gccccttgggacatggagagggaagcaaggggtgggtgcttgccatctgcctccttctaa 

atgctctccttaacacaccttcaatcctctgccctgctcagccccatggacctcttgcca 
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acagcgttcgcaccctccatctcttgacttgctctcagaccctgcgaactctgagcttac 

ctaagaactaccctctgaagcagcctggtctacagattgagttccagacctgccctatcc 

ctatggtatggaagcaccctgaggacctcctgttgcccagtcacctacctctgtctcagt 

ttgttgtcccctcctcagatttacaggcttgcatcaataaagaaatgagacatgggcctc 

agagaagctgttgtcatagagaccatgatgctggaagccctaggggcagggaagggagac 

actgtggttcttcttgggtccttatagagggaggacaaatgtgccctgccatgtgacttg 

cagtcctcagtttctcagacgcactcttataattcctatgggctgtatgctgagctctta 

ctcagcataggaaccccagagcccgatcatgttgtatcccgcctgccctgagagctgtgc 

tattctgaaatgttagaatgtatctaataacaataaatccacaagttatatcagtgttgt 

tggctgtgacctgttaaaagggtctaagttgtttattaaaaagatatggagatggattac 

tgagaaagaaattagaaacaacatctggacagtggaggagccagcactggggaggaaagg 

gcagacagatctctgagttcaaggccagcctggtctacagattgagttccaggatagcta 

gggccacacagaggaaaccctgttttgaaaaccaaacagtcaaataataaaaacaaactt 

agtagcaccttgtacagacagagagaaaggttcccaggagagctcagaccccagcaccag 

aggtggcaaggcagagtctcaaagcccaggtggcaaaagtgggatctgttagcataaacc 

caaagggcgcgtgggacaggcagggagtaccctttgatgcaagctcactctggtgagggc 

ccctcacccctggatgtctgttagcaagggaatcagtcagtgtctcagtctgttaacatc 

tgtgagaggggaaaggctgctgcagacatggcctgagcagcatctggattcgaacatttg 

cactttagggcctgctctctcccctgggtggggcactcgccattcattgcctttacgaca 

gctgtgagagagaggttgctgcaagtgtgtatggctgggttagctcgagcccaccaggca 

atcatgatttccctcgacatctagtcattaacaaacacaggttcttttactacaatttta 

actaccaatattaactaataatgtgtataaaatatataacacagtacacatatataagta 

catagatgttagtacatataaatattatatatatatatataattgtaattaatattactt 

aattttattttatcatttgatattattttactgttagttataacaatgtgcataatatat 

gtgtaatataaaatataattttattatttaatattattatataaatttaattaatattaa 

ttatacctatatatttagtacatatacataggttacagaatggctacaaaagtgccagga 

gccatcaaggagaagctaaaagccagcaagtgatcttcctgagacggttctgccatggac 

tgtacaattagtgatggatttgcttctgtaggcaaggacgaggagatttcattttaggaa 

agattcctgctattaatatgcttttcctggtattattaaatatatataacaatcactagg 

tattagcccaccgttttgaacagaatgttctgcagaacaatgaagatgtactctcttgta 

atgatgctatatagacaaatagattatttcttttttaaaaaagaaaaaagagccgggcga 

tggtggcacatgcctttaatcccagcacttgggaggcagaggcaggcagatttctgagtt 

caaggccagcctggactacagagtgagttccaggacagccagggctactcagagaaactc 

tgtcttggaaaaaaaaaaagaggaagaaagaaaaaagatttatttatttattttatacat 

atgagtacaccatcagacacacaagaagagggcaccagaccccattacagatggttgtga 

gccaccatgtggttgctgggaattgaactcaggacctctggaagaacagttggtgctctt 

aacccctgagccatctctccagcccaaatagatgatttcttaattcttaaggatgatcct 

ataagaattcctaaacttacattagtaattattaagctcttttacaataggacttctatt 

aagtcttctctaatatgaaaacttcaataagaactctgccagtcttcaagtgtcatgagt 

tagttgcttctgagatagcaagtaggcatcaacaacttagagcacattctaggaggttgt 

aaaaccattaaccagtggtcttaaaaagggaactaacaataggctataggtgcaaggaca 

gaagataaaatattgactaggtttatcaatacaaaatttacccacaaaagttatgttttt 

gacttttcataaaaactctttatgaacctgtagaactggtgaaagatgacgaatgcttag 

ccagataattactcctaatagatatgcatgtgaatattctgtgctgtaaacttatttatg 

tttgaacttccagtgaacttttgtttaaaaaagggggggggttgaaaaagccatgtgatc 

tattctcctagaaagggtacagaagactaagaaagattacattggagatgtaaccttgga 

gagaaagctttgggagcaagagcatagagagcaaggccattgtggcatcagagcaggagg 
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agagagcaagattagaaggagatgcagagtggaataacttagaaactataaggcaacata 

aaaaattaagagagccatatgcagaatgcagagggaaagagaaaaaaaaaaaaaaaaaga 

agctgcagggagagcagaaggagcaggcaggcttctcctgaccatggggtagaacagggc 

ttttcttaataccaaggcaggcttagtcttaaggataataaagcttttctttcttacaga 

cttggttttaattcatttagcaataaaagtgtaaaagtgttttctttccctatgcaataa 

agattggagcttatttttcagccagaatgagtgagttctctctgcaacggtgcttggtct 

tttgcttcatatacacacataagtgtgtgtgtgcgcgcatgcgtgtgtgtgtgtgtgtgt 

gtgtgtgtgtgtgtaagtgtgcaattatcagatggcatggaagctgggctcaattggttc 

aaatggggacttgtgagggtatatgcatgaatctgtatatgaattcatgtgagcttatiat 

atatttgcttgtgtaaaagtttttccttctgtgagtgtgactctcttctcctggttcaat 

agaggtttattgcttcaaacttccccctagcctgacagtcgagaggcatctggacaagag 

agaaaaggctctagccattaatccttttcttagatccattttcttagagaactttcttag 

gaaactgtttagagagaacatagaaacaggctgaaatcacttgtcaaactgtcccctttt 

cttcctaaggacttctactagcagactgggagttagagctgcacagtccctgaggagata 

gaacaaaggctgctttactgaatcccctgctgttttaagatgaggttctaaaggagattg 

cagtttctgacccccaaaaggaactcaggcaggtcagctacagtatcaaagtgacttaaa 

cttaagatagggatatgttttattattaaacagctaccctaaatatctcataagatcaag 

cttaccccggtgacacttccccctctgttgcctcaagaggaaccaagcagaaagaaccgc 

cagggctggctcctggcacaaatgggttaaagatgttgtagcatggggaaatgaagagat 

ggctcagctattaagagaatatcttactcttccagaggaccagtgttcaattcccagcaa 

acatatcaggtgccacaccatcacttgtagctccagctgcagatctgctacatctggcct 

ccataggcacccacacacaggtggcacccacaattaaaaaaataagataaatctaaaaca 

gcaaagttaaagcatgagctgaaactagtaaagtgcttgtgtggcatagaccaagacctg 

ggtttggtccctacctgttagaaatagtctcagtatcacacaaaggaacacccaagcgaa 

gcaaaagctcccagcaagacaaaactacagtcttcattgagagtgtgcacgctgaagacc 

gagcacactgggtgcaaaatgtacttggattctgtttgcttgttttgttccagacagggt 

ttctctgcataacagccttggctgtccctgaaattcactgtgtagaccaggctggcctca 

aaccctcagagatccgttcacccacgcttatctaggcttcagtctcaccctgtgagatgg 

cctgaaagttgttagaaccgcgcgggatctatttctgacagactggctggcatcttttcc 

ttctctcagcatgagattcctggggcgttcccatttcagcatcaagcatggtagcagagt 

tggaacctgagggctgagggctcagactcagaccataaactggaagcagagagaactgga 

gattgtgggaggctttgaaacctcagctcctgccccagcaaataccttccagcaaggcca 

cacctcttaaacctccccaaacagggtcaccaactggggacctaatattcaaatgcccac 

aaatatgggagacatgacatccaaaccgccaggacaggtgtatacctccatgcttggttt 

ccgtagtaagaaacactaaacattagcctttcctaataaacactgatataaagccctgct 

attctcgatgtttttcctctgttctgctcctccttctccacctgcttctctgttctctga 

cctcttctgtgtcacagatagccctgccatgtccatctgccagccatgttctgt'ctactt 

gcctctctctctgctctggactcttctagatgcctctggctgttctttctcatatctaca 
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>mouse Gene 216_cDNA 

AAAGGCACTCCCAGCCAT GGGCT CGAGGTGCGGGAGACCCGGGGGGTCT CCG 

GTGCTGCTATTGCTGCCGCTGTTGCTGCCCTCGTGTCCGCTGCGGAGCGCTCG 

GAT GTTTC CAG GAAATGC C CATGGAGAGCTAGT CACT CCCCACTGGATCCT GG 

AGGGCAGACTCTGGCTCAAGGTCACCCTGGAGGAGCCGATCTTGAAGCCTGAC 

TCGGTGCTGGTGGCTTTAGAGGCTGAAGGCCAGGATCTCCTGCTTGAACTGGA 

GAAGAAGCACAAGCTTC TGGCCC CAG GAT ACACAGAAAC CCACT AC AG GCCAG 

ATGGGCATCCGGTAGTGCT GTCC CCCAACCACACGGAT CATT GCCAATATCAC 

GGGCGTGTGAGGGGCTTCCGGGAATCCTGGGTGGTTCTCAGCACCTGCTCTG 

GGATGAGTGGCCTTATTGT GCTCAGCAGCAAAGTCAGCTATTAT CTGCAACCTC 

GGACTCCTGGGGATACCAAAGACTTCCCAACCCACGAGATCTTCCGGATGGAG 

CAGTTGTTCACCTGGAGAGGGGTCCAGAGAGACAAGAACTCCCAATACAAAGC 

AGGAAT GGCCAGT CTTC CTCAT GTCCCCCAGAGCCGGGT GAGGCGAGAGGCG 

CGCAGGAGTCCC AGGTACCT GGAACTGTACATAGTGGCT GACCACACCCT GAA 

CTT GAACGACACGAGACAGCGCCTCCT GGAGGTTGCCAATTGCGTGGACCAGA 

TTCT CAGGACTCT GGATATACAGTTGGT GTTGACCGGGCT GGAAGTGT GGACC 

GAGCAGGATCTCAGTCGCATCACTCAGGACGCAAACGAAA.CGCTCTGGGCTTT 

CCTACAGT GGCGCCGC GGGGTGTGGGCCAGGAGACCACACGACTCCACACAA 

CTGCTCACGGGCCGCACCTTCCAGGGTACCACGGTGGGCCTGGCACCTGTGG 

AGGGCATATGCCGCGCGGAGAGCTCCGGAGGTGTGAGCACAGACCACTCGGA 

ACTCCCCATCGGCACAGCAGCCACCATGGCCCACGAGATAGGCCACAGCCTGG 

GCCTCCACCATGATCCCGAGGGCTGCTGCGTGCAGGCCGATGCAGAGCAAGG 

AGGCTGCGTCATGGAGGCAGCCACAGGGCACCCTTTCCCGCGCGTCTTCAGCG 

CCT GCAGCCGCC GCCAGCTGCGCACCTT CTTCCGCAAAGGGGGCGGT CCTT GC 

CTCTCCAACACCTCGGCGCCGGGGCTCCTGGTGCTGCCCAGCCGCTGCGGAAA 

CGGCTTCTTGGAAGCAGGAGAAGAGTGCGACTGCGGTTCTGGCCAGAAGTGC 

CCGGACCCCTGCTGCTTTGCCCACAATTGCTCCCTGCGTGCGGGGGCTCAATG 

TGCCCACGGTGATTGCTGTGCGAAGTGCCTGTTAAAGTCCGCGGGCACGCCTT 

GTCGTCCTGCTGCGACTGACTGCGATCTCCCCGAGTTCTGCACCGGCACCTCC 

CCGTATTGCCCCGCAGATGTTTACCTACTGGATGGCTCACCCTGCGCTGAGGG 

T CGCGGCTATTGC CTAGACGGCT GGT GT CCCACGCTGGAGCAGCAGTGCCAGC 

AGCTATGGGGGCCTGGGTCCAAGCCGGCCCCAGAGCCATGTTTCCAGCAGAT 

GAACTCCAT GGGGAATT CGCAAGGGAACT GTGGCCAGGACCACAAGGGTAGC 

TTC CT GCCTTGTGCT CAGAGGGACGCTCT GT GT GGGAAACTGCTGTGCCAGGG 

AGGGGAGCCGAACCCACTAGT GCCGCACATAGTGACTATGGACTCCACAATT C 

TCCTAGAGGGCCGCGAAGTGGTTTGCCGAGGGGCCTTTGTGCTCCCAGATAGT 

CACCTGGACCAGCTTGACTTGGGTCTGGTAGAGCCAGGCACCGGCTGTGGACC 

TAGAATGGTGTGCCAGGACAGGCACTGTCAGAATGCTACCTCCCAGGAGCTGG 

AACGTTGCTTGACTGCCTGCCATAACGGTGGGGTTTGCAATAGCAATCGTAACT 

GTCACTGTGCTGCTGGCTGGGCTCCACCCTTCTGTGACAAGCCTGGCTTGGGT 

GGTAGCGTGGATAGTGGCCCTGCACAGTCTGCAAACCGAGATGCCTTCCCCTT 

GGCCATGCTCCTCAGCTTCCTGCTGCCTCTGCTCCCTGGGGCTGGCCTAGCCT 

GGT GCT ACTACCAGCT C CCAACATTCTGTCATCGAAGGGGACTGTGCT GCAGG 

AGGGACCCCCTAT GGAATAGAGACATACCCCTGGGCAGTGTGCATCCGGT GGA 

GTTTGGCTCCATCATCACTGGAGAGCCCTCGCCCCCTCCCCCATGGACCTCTTG 

CCAACAGCGTTCGCACCCTCCAT CTCTTGACTTGCTCTCAGACCCT GCGAACTC 

T GAGCTT ACCTAAGAACTACCCT CTGAAGCAGCCT GGTCTACAGATTGAGTTCC 

AGACCTGCCCTATCCCTATGGTATGGAAGCACCCTGAGGACCTCCTGTTGCCCA 

GTCACCTACCTCTGTCTCAGTTTGTTGTCCCCTCCTCAGATTTACAGGCTTGCAT 

CAATAAAGAAAT GAGACAT GGGCCT CAGAGAAS CT GTTGTCATAGAGACCATG 

ATGCTGGAACCCCTAGGGGCAGGGAAGGGAGACACTGTGGTTCTTCTTGGGTC 

CT TATAGAGGGAGGACAAATGT GCCCTGCCAT GTGACTTGCAGTCCTCAGTTTC 

TCAGACGCACTCTTATAATTCCTATGGGCT GTATGCTGAGCT CT TACT CAGCATA 

GGAACCCCAGAGCCCGATCATGTTGTATCCC S CCTGCCCTGAGAGCTGT GCTAT 

TCTGAAATGTTAGAATGTATCTAATAACAATAAATCCACAAGTTATATCAGEIAAA 

AAAAAAAA 
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ELEKKHKLLAPGYTETHYRPDGHP>^ 
DFPTireiFRMEQLFTWRGVQRDKNSQYKftGMA^ 

oIlrtldiqlv^tglewteqdlsritqd^^ 

GGVSTDHSELP I GTAATMAHEI GHSLGLHHD PEGCCVQADAEQGGCVMEAATGHP FPRVFSACS RRQLRTFFRKGGGPCL 
SNTSAP^LLVLPSRCGNGFLEAGEECD^^ 

EFCTGTSPYCPADVYLLDGSPCAEGRGYCLDGWCPTLEQQCQQLWGPGSKPAPEPCFQQMNSMGNSQGNCGQDHKGSFLP 
CAQRDALCGKLLCQGGEPNPLVPHIVTMDSTILLEGREWCRGAFVLPDSHLDQLDLGLVEPGTGCGPRMVCQDP^CQNA 

T^QELERCLTACHNGGVCNSNRNCHGAAG 

QLPTFCHRRGLCCRRDPLWNRDIPLGSVHPVEFGSIITGEPSPPPPWTSCQQRSHPPSLDLLSDPANSELT , 
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hSene216 1 MGWRPRRARGTPl.Il.LL . LLLLLWPVPGAGVLQGHIPGQPVTPHWVLDGQ 49 

— 51 TTi i i" l mm i it 100 

hGene216 50 PWRTVSLEEPVSKPDMGLVALEAEGQELLLELEKNHRLLAPGYIETHYGP 99 

— 101 Ti H im!i!ii"- : tTTTTTTTTT^iTm s tTTTT 137 

hGene216 100 DGQPVVLAPNHT VRCFHGLWD AP PEDHCHYQGRVRGFPD SWWLCTCSGM 149 
mGene216 138 SGLIVLSSKVSYYLQPRTPGDTKDPPTHEIPRlffi 187 
hGene216 150 iiii T LiRNASYYLRiwPPRGSKDFSTHEIFRMEQLLTWKGTOGHRDPGN 199 

— 6 188 tt! TuTWn\ m^iiiu il m It ™ 231 

hGene216 200 l^GJffTSLPGGPQSRGRREA 249 
hGene216 250 ii E VANY^QLLRTLDIQVALTGLEVWTERDRSRVTQDANATLWAFLQWR 299 

— 216 282 inHimi^ 2 

hGene216 300 RGLWAQRPhAsaQLLTGRAFQGATVGLAPVEGMCRAESSGGV 349 

— 216 332 TTTmrrr i t ? i nrrr n niTrmir n i ? ? i ii t 11 1 381 

hGene216 350 IGAAATMAHE IGHSMLSHDPDGX3CVE AAAESGGCVMAAATGHPFPRVFS 399 
mGene216 382 ACSRRQLRTFFRKGGGPCLSNT SAPGLLVLPSRCGNGFLEAGEECD 'j'**^ 431 

II I II I II IMI III I II I I- 1 1 Ill-Ill nil II I _ 

hGene216 400 ACSRRQLRAFFRKGGGACLSNAPDPGLPVPPALCGNGFVEAGEECDCGPG 449 

hGeneaiS 450 QECRDLCCFAHNCSLRPGAQCAHGDCCVRCLLKPAGALCRQAMGDCDLPE 499 
mGen e 216 482 PCTGTSPyCPADVYLLDGSPCAEGRGYCLDGWCPTLE^MGQQLWGPGSKP 531 

I I III I =11 III I I ID II II I I III II III I III I 

hGene216 500 FCTGTSSHCPPDVYLLDGSPCARGSGyCWDGACPTLEQQCQQLWGPGSHP 549 

mGene216 532 APEPCrQQMHSMGNS^H^DHKGSFLP^QTOALCG^LC^GEPOT>L 581 

iii 111 • ■• in -i urn iiiiiiii iiii-i- J KOQ 

hGene216 550 APEACFQWNSAGDAHGNCGQDSEGHPLPCAGRDALCGKLQCQGGKPSLL 599 
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hGen e 216 600 AphMVPVDS^ 649 ' 

— 632 trrr~tu HTr^TmMnTrTrTTTTTim^ 681 

hGene216 650 CQSRRCRKNAFQELQRCLTACHSHGVCNSNHNCHCAPGWAPPFCDKPGFG 699 
„,Gene216 682 GSVDSGPAQSANRDAFPLA^^ "0 
hGene216 700 GSMDSGPVQAENHDTFLLAMLLSVLLPLLPGAGLAWCC5fRIjPGAHLQRCS 749 

mGene216 731 LCCRRDPLW i^IPLGSVHPVEFGSIITGEPSPPPPWTSCQQR 773 

hGene216 750 WGCRRDP AC SGPKD6PHRDHPL6GVHPMELGP T ATGQPWPLD PE N S HEP S 799 

mGene216 774 SHP. . PSLDLLSDPANSELT 791 

hGene216 800 SHPEKPLPAVSPDPQADQVQMPRSCLW 826 
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10 30 50 

CGGGCACGGGTCGGCCGCAATCCAGCCTGGGCGGAGCCGGAGTTGCGAGCCGCTGCCTAG 
+ + + + + + 

70 90 110 

AGGCCGAGGAGCTCACAGCTATGGGCTGGAGGCCGCGGAGAGCTCGGGGGACCCCGTTGC 

+ + + + + + 

MetGlyTrpArgProArgArgAlaArgGlyThrProLeuL 

130 150 . 170 

TGCTGCTGCTACTACTGCTGCTGCTCTGGCCAGTGCCAGGCGCCGGGGTGCTTCAAGGAC 

+ + + + + + 

eulieuLeuljeuLeuLeuLeuLeuLeuTrpProValFroGlyAlaGlyValLeuGlnGlyH 

190 210 230 

ATATCCCTGGGCAGCCAGTCACCCCGCACTGGGTCCTGGATGGACAACCCTGGCGCACCG 

+ + + + + + 

isIleProGlyGlnProValThrProHisTrpValLeuAspGlyGlnProTrpArgThrV 

250 270 290 

TC AGC CTGGAGG AGC CGGT C TCG AAGC C AGAC ATGGGGCTGGTGGCCCTGG AGGC TGAAG 

+ + + + + - + 

alSerlieuGluGluProValSerLysProAspMetGlyLeuValAlaLeuGluAlaGluG 

310 330 350 

GCCAGGAGCTCCTGCTTGAGCTGGAGAAGAACCACAGGCTGCTGGCCCCAGGATACATAG 
+ + + + + + 

lyGlnGluLeuLeulieuGluLeuGluLysAsiiHisArgLeuLeiaAlaProGlyTyrlleG 

370 390 410 

AAACCCACTACGGCCCAGATGGGCAGCCAGTGGTGCTGGCCCCCAACCACACGGTGAGAT 

+ + + + + + 

luThrHisTyrGlyProAspGlyGlnProValValLeuAlaProAsnHisThrValArgC 

430 450 470 

GCTTC C ATGGG CTC TGGGATGC AC CGC CAGAGGATCATTGC CACTACC AAGGGCGAGTAA 

+ + + + + + 

ysPheHisGlyLeuTrpAspAlaProProGluAspHisCysHisTyrGlnGlyArgValA 
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490 510 530 

GGGGCTTCCCCGACTCCTGGGTAGTCCTCTGCACCTGCTCTGGGATGAGTGGCCTGATCA 

+ + + + -t- + 

rgGlyPheProAspSerTrpValValLeuCysThrCysSerGlyMetSerGlyLeuIleT 

550 570 590 

CCCTCAGCAGGAATGCCAGCTATTATCTGCGTCCCTGGCCACCCCGGGGCTCCAAGGACT 

+ + + + + + 

hrLeuSerArgAsnAlaSerTyrTyrLeuArgProTrpProProArgGlySerLysAspP 

610 630 650 

TCTCAACCCACGAGATCTTTCGGATGGAGCAGCTGCTCACCTGGAAAGGAACCTGTGGCC 

+ + _ + + + + 

heSerThrHisGluIlePheArgMetGluGlnLeuLeuThrTrpLysGlyThrCysGlyH 

670 690 710 

ACAGGGATCCTGGGAACAAAGCGGGCATGACCAGCCTTCCTGGTGGTCCCCAGAGCAGGG 

+ + + + + + 

isArgAspProGlyAsnLysAlaGlyMetThrSerLeuProGlyGlyProGlnSerArgG 

730 750 770 

GCAGGC GAGAAGCGC GC AGG ACCCGG AAGTACCTGG AAC TGTAC ATTGTGGC AGAC C ACA 

+ + + + + + 

lyArgArgGluAlaArgArgThrArgLysTyrLeuGluLeuTyrlleValAlaAspHisT 

790 810 830 

CCCTGTTCTTGACTCGGCACCGAAACTTGAACCACACCAAACAGCGTCTCCTGGAAGTCG 

+ + + + + + 

hrLeuPheLeuThrArgHi sArgAsnLeuAsnHi sThrLysGlnArgLeuLeuGluValA 

850 870 890 

CCAACTACGTGGACCAGCTTCTCAGGACTCTGGACATTCAGGTGGCGCTGACCGGCCTGG 

+ + + + + + 

laAsnTyrValAspGlnLeuLeuArgThrLeuAspIleGlnValAlaLeuThrGlyLeuG 

910 930 950 

AGGTGTGGAC CGAGCGGGACCGCAGCCGCGT.C ACGCAGGACGCCAACGC C AC GCTCTGGG 

+ + + + + + 

luValTrpThrGluArgAspArgSerArgValThrGlnAspAlaAsnAlaThrLeuTrpA 

970 990 .1010 
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CCTTCCTGCAGTGGCGCCGGGGGCTGTGGGCGCAGCGGCCCCACGACTCCGCGCAGCTGC 

+ + + + + + 

laPheLeuGlnTrpArgArgGlyLeuTrpAl aGlnArgProHi s AspSerAl aGlnLeuL 

1030 1050 1070 

TCACGGGCCGCGCCTTCCAGGGCGCCACAGTGGGCCTGGCGCCCGTCGAGGGCATGTGCG 

+ + + + + 

euThrGlyArgAlaPheGlnGlyAlaThrValGlylieuAlaProValGluGlyMetCysA 

1090 1110 H30 

GCGCCGAGAGCTCGGGAGGCGTGAGCACGGACCACTCGGAGCTCCCCATCGGCGCCGCAG 

+ + + + + 

rgAlaGluSerSerGlyGlyValSerThrAspHisSerGluLeuProIleGlyAlaAlaA 

1150 1170 H90 

CCACCATGGCCCATGAGATCGGCCACAGCCTCGGCCTCAGCCACGACCCCGACGGCTGCT 

+ + + + + + 

laThrMetAlaHisGluIleGlyHisSerLeuGlyLeuSerHisAspProAspGlyCysC 

1210 1230 1250 

GCGTGGAGGCTGCGGCCGAGTCCGGAGGCTGCGTCATGGCTGCGGCCACCGGGCACCCGT 

. + + _+ + + 

yS ValGluAlaAlaAlaGluSerGlyGlyCysValMetAlaAlaAlaThrGlyHisProP 

1270 1290 1310 

TTCCGCGCGTGTTCAGCGCCTGCAGCCGCCGCCAGCTGCGCGCCTTCTTCCGCAAGGGGG 

+ „ + + + + + 

heProArgValPheSerAlaCysSerArgArgGlnLeuArgAlaPhePheArgLysGlyG 

1330 1350 1370 

GCGGCGCTTGCCTCTCCAATGCCCCGGACCCCGGACTCCCGGTGCCGCCGGCGCTCTGCG 

+ + + + + + 

lyGlyAlaCysLeuSerAsnAlaProAspProGlyLeuProValProProAlaLeuCysG 

1390 1410 1430 

GGAACGGCTTCGTGGAAGCGGGCGAGGAGTGTGACTGCGGCCCTGGCCAGGAGTGCCGCG 

+ + + + + + 

lyAsnGlyPheValGluAlaGlyGluGluCysAspCysGlyProGlyGlnGluCysArgA 

1450 1470 1490 
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ACCTC TGCTG C TTTGCTC AC AACTG CTCGCTGCGC CC GGGGG C C C AGTGCGCC C ACGGGG 

+ + + + + + 

spLeuCysCysPheAlaHisAsnCysSerLeuArgProGlyAlaGlnCysAlaHisGlyA 

1510 1530 1550 

ACTGCTGCGTGCGCTGCCTGCTGAAGCCGGCTGGAGCGCTGTGCCGCCAGGCCATGGGTG 

. + x + + + + 

spCysCysValArgCysLeuLeuLysProAlaGlyAlaLeuCysArgGlnAlaMetGlyA 

1570 1590 1610 

ACTGTGACCTCCCTGAGTTTTGCACGGGCACCTGCTCCCACTGTCCCCCAGACGTTTACC 

. + + + + + 

spCysAspLeuProGluPheCysThrGlyThrSerSerHisCysProProAspValTyrL 

1630 1650 1670 

TACTGGACGGCTCACCCTGTGCCAGGGGCAGTGGCTACTGCTGGGATGGCGCATGTCCCA 

+ + + + + + 

euLeuAspGlySerProCysAlaArgGlySerGlyTyrCysTrpAspGlyAlaCysProT 

1690 1710 1730 

CGCTGGAGCAGCAGTGCCAGCAGCTCTGGGGGGCTGGCTCCCACCCAGCTCCCGAGGCCT 

+ + + + ■ + + 

hrLeuGluGlnGlnCysGlnGlnLeuTrpGlyProGlySerHisProAlaProGluAlaC 

1750 1770 1790 

GTTTCCAGGTGGTGAACTCTGCGGGAGATGCTCATGGAAACTGCGGCCAGGACAGCGAGG 

+ + + + + + 

ysPheGlnValValAsnSerAlaGlyAspAlaHisGlyAsnCysGlyGlnAspSerGluG 

1810 1830 1850 

GCCACTTCCTGCCCTGTGCAGGGAGGGATGCCCTGTGTGGGAAGCTGCAGTGCCAGGGTG 

+ + + + + + 

lyHisPheLeuProCysAlaGlyArgAspAlaLeuCysGlyLysLeuGlnCysGlnGlyG 

1870 1890 1910 

GAAAGCCCAGCCTGCTCGCACCGCACATGGTGCCAGTGGACTCTACCGTTCACCTAGATG 

+ + + + + + 

lyLysProSerLeuLeuAlaProHisMetValProValAspSerThrValHisLeioAspG 

1930 1950 1970 

GCCAGGAAGTGACTTGTCGGGGAGCCTTGGCACTCCCCAGTGCCCAGCTGGACCTGCTTG 
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lyGlnGluValThr-CysArgGlyAlaLeuAlaLeuProSerAlaGlnLeuAspLeuLeuG 

1990 2010 2030 

GCCTGGGGCTGGTAGAGCCAGGCACCCAGTGTGGACCTAGAATGGTGTGCCAGAGCAGGC 

+ + + + + + 

lyLeuGlyLeuValGluProGlyThrGlnCysGlyProArgMetValCysGlnSerArgA 

2050 2070 2090 

GCTGCAGGAAGAATGCCTTCCAGGAGCTTCAGCGCTGCCTGACTGCCTGCCACAGCCACG 

+ + + + - + + 

rgCysArgLysAsnAlaPheGlnGluLeuGlnArgCysLeuThrAlaCysHisSerHisG 

2110 2130 2150 

GGGTTTGCAATAGCAACCATAACTGCCACTGTGCTCCAGGCTGGGCTCCACCCTTCTGTG 

+ + + + + + 

lyValCysAsnSerAsnHisAsnCysHisCysAlaProGlyTrpAlaProProPheCysA 

2170 2190 2210 

ACAAGCCAGGCTTTGGTGGCAGCATGGACAGTGGCCCTGTGCAGGCTGAAAACCATGACA 

+ + + +--- + + 

spLysProGlyPheGlyGlySerMetAspSerGlyProValGlnAlaGluAsnHisAspT 

2230 2250 2270 

CCTTCCTGCTGGCCATGCTCCTCAGCGTCCTGCTGCCTCTGCTCCCAGGGGCCGGCCTGG 

+ + + + + + 

hrPhelieuLeuAlaMetLeuLeuSeryaiLeuLeuProLeuLeuProGlyAlaGlyljeuA 

2290 2310 2330 

CCTGGTGTTGCTACCGACTCCCAGGAGCCCATCTGCAGCGATGCAGCTGGGGCTGCAGAA 

+ + + + + + 

laTrpCysCysTyrArgLeuProGlyAlaHisLeuGlnArgCysSerTrpGlyCysArgA 

2350 2370 2390 

GGGACCCTGCGTGCAGTGGCCCCAAAGATGGCCCACACAGGGACCAC^CCCCTGGGCGGCG 

+ + + + + + 

rgAspProAlaCysSerGlyProLysAspGlyProHisArgAspHisProLeuGlyGlyV 

2410 2430 2450 

TTCACCCCATGGAGTTGGGCCCCACAGCCACTGGACAGCCCTGGCCCCTGGACCCTGAGA 
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alHisProMetGluLeuGlyProThrAlaTlirGlyGlnProTrpProLeuAspProGluA 

2470 2490 2510 

ACTCTCATGAGCCCAGCAGCCACCCTGAGAAGCCTCTGCCAGCAGTCTCGCCTGACCCCC 

+ + + + + 

snSerHisGluProSerSerHisProGluLysProLeuProAlaValSerProAspProG 

2530 2550 2570 

AAGCAGATCAAGTCCAGATGCCAAGATCCTGCCTCTGGTGAGAGGTAGCTCCTAAAATGA 
InAlaAEpGlnValGlnMetProArgSerCysLeuTrpEnd 

2590 2610 2630 

AC AGATTTAAAGAC AGGTGGCCACTGAC AG C CACTCCAGG AACTTG AACTGC AGGGGC AG 
+ + + + + 

2650 2670 2690 

AGCCAGTGAATCACCGGACCTCCAGCACCTGCAGGCAGCTTGGAAGTTTCTTCCCCGAGT 
+ + + + + + 

2710 2730 2750 

GGAGCTTCGACCCACCCACTCCAGGAACCCAGAGCCACATTAGAAGTTCCTGAGGGCTGG 

2770 2790 2810 

AGAACACTGCTGGGCACACTCTCCAGCTCAATAAACCATCAGTCCCAGAAGCAAAGGTCA 
+ + __, + + + 

2830 2850 2870 

CACAGCCCCTGACCTCCCTCACCAGTGGAGGCTGGGTAGTGCTGGCCATCCCAAAAGGGC 
+ + + + + 

2890 2910 2930 



2950 2970 2990 

GGGCATG ACTGCTGGCCAGAAGCTAGTGGTCCTGGGGC C C TATGGTTCGACTGAGTC C AC 
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3010 3030 3050 

ACTCCCCTGCAGCCTGGCTGGCCTCTGCAAACAAACATAATTTTGGGGACCTTCCTTCCT 
+ + + + + + 

3070 3090 3110 

GTTTCTTCCCACCCTGTCTTCTCCCCTAGGTGGTTCCTGAGCCCCCACCCCCAATCCCAG 
+ + + + + + 

3130 3150 3170 

TGCTACACCTGAGGTTCTGGAGCTCAGAATCTGACAGCCTCTCCCCCATTCTGTGTGTGT 
+ + + + + + 

3190 3210 3230 

CGGGGGGACAGAGGGAACCATTTAAGAAAAGATACCAAAGTAGAAGTCAAAAGAAAGACA 
+ + + + - + + 

3250 3270 3290 

* 

TGTTGGCTATAGGCGTGGTGGCTCATGCCTATAATCCCAGCACTTTGGGAAGCCGGGGTA 
+ + + + + + 

3310 3330 3350 

GGAGGATCACCAGAGGCCAGCAGGTCCACACCAGCCTGGGCAACACAGCAAGACACCGCA 
+ + + + + + 

3370 3390 3410 

TCTACAGAAAAATTTTAAAATTAGCTGGGCGTGGTGGTGTGTACCTGTAGGCCTAGCTGC 
+ + + + + + 

3430 3450 3470 

TCAGGAGGCTGAAGCAGGAGGATCACTTGAGCCTGAGTTCAACACTGCAGTGAGCTATGG 
+ + + + + + 

3490 3510 3530 

TGGCACCACTGCACTCCAGCCTGGGTGACAGAGCAAGACCCTGTCTCTAAAATAAATTTr 
+ + + + + + 

3550 3570 3590 
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AAAAAGACATAAAAAAAAAAAAAAA^^ 

. + + + + + 



3610 

AAAAAAAAAAAAAAAAAAAAAAAAAA 
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10 30 50 

CGCAACTGGAGTGAGTCTCTTTCTTGCAACCCAAACAGCTGGGCTAGAACAGAGGAGTCA 
+ + + + + + 

70 90 110 

CATGTTAGTGGGGTTTATTCTTAGACACCATCTGTAGGAGGGGGCCAGGGGCTGCAAGTT 
+ + + + + + 

130 ISO 170 

ACATGACATTTATCACAAATTACCTTACGCTGAAGTGATTTGTGTCCCTTTCTGCTCCTT 
__ + + + + + + 

190 210 230 

GAGGCTTCGTGTGTTGCTCACATTTGCCACCCACAATGACCAGCCCCGTGCATGGCACGC 
+ + + + + + 

250 270 290 

AGTAGGTGGGCACGTGAACTAAGCGTTTGTCTTGTCTACATTGCTGACAGTGGGGAGGGT 
+ + + + + + 

310 330 350 

GGAG AAC ATGGACTAAATC C AGAGGTGGGACAGGGAGAGAGC AAGTCAAATGCAGC C AGA 
+ + + + + + 

370 390 410 

TGTTTGGGGGCTGTCTGGGGAAGAGAGGGTGTAGGGAGGACCATAAGCAGCTGCTGAAGC 
+ + + + + + 

430 450 470 

TGCGGCTACTGGGGGGATTTGGGAATATCCGGCTTAGGAGTCTTCACCTGCTTCTGCATA 
+ + + + + + 

490 510 530 

CTTCCAAGGGGACATATGTCCATCTTGGATTGGAGGCTCTCAGAGGGGTTAGTTGTGTCC 
+ + + + + + 

550 570 590 
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GCTTCCAGTGGTCTGAGGCCTCCTGAGACTTTTTGTACATTCAATTAAAGTGTAACATAC 



610 630 650 

AGAAAAGTTCACACATTATAAATGCAACTCAAGGACTTTTCCAAAAGCGAACACACCCAG 



670 690 710 

ATC AAGAAATAG ACCATC CTAC AGTC CC CC CTTAC ACTCTGT ACC AGTTGCAGC C CCCAC 



730 750 770 

AAGGGTAACTACTGTCTTGACTTCGAACACCATAGATTTGATTTGCCTGTTTTTAAACTT 



790 810 830 

TACATAAGTAGAATCACAGAGTGTGTACAATGACTTTGGAAAACTGTTTGACAATATCTA 



850 870 890 

TTAAAGCTAAAATACCCTTGCCCTATGAACCTGAAATTCCACCCACCTTGCCAAGGGACA 



910 930 950 

AAAAGTTCCCCTCTAAATGCACCAGGCTGTCAGGGATGAAGCGTTGGCTTTGGGGCCCCC 



970 990 1010 

ATTCACACACATGACCTTTTCTGGGGCACCCAAGCATCAGCCTGTCGTCACCAGGTGCCA 



1030 1050 1070 

CCCTGGCGATCTCTGAAGGCTGGAGTCGGAGTGCCTCCCTCAGACATCCTGTTCTGCGTC 



1090 1110 1130 

ACTCCTTGGGAGAAGTCGTGTTTACAGATGGTGGGTGTCACCCATGCCAAGCACTTCTAA 
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1150 1170 1190 

GGGTTAATGCTCACTGGTTTGCCTGGTTCCCAGGACATTTCCTGATGCCCCTCTGGAGGG 



1210 1230 1250 

TGACGCCAACAAGCCAGTGGAGAAGCCATCTTTCCCAGGTGCTGTCAGGCGCCCCCGGAG 



1270 1290 1310 

CTGCTCGGTGCATCCTAGGATCCCTCTTCCTCAGCTTTGGTTTGATGGCCTCATCTCCTC 
+ + + + + + 



1330 1350 1370 

CCCTGCAACCTCAAAATGTAAATAAACCCTTTCTCAGAGACTTCGGCAGAAAATTCCTCT 
+ + + + + + 



1390 1410 1430 

GACCTGCACTTGGACACAGCTCATCTGGGTTTGGGAGGTGTCAACTGTGTAAGGATGACT 
+ + + + + + 



1450- 1470 1490 

CTGATCCCCATGTGGCTTTTCGACTGTGTCCCCTCTACAGTCAGTTATTAGCACTGACTG 
+ + + -- + + + 



1510 1530 1550 

TGCTAGGAAGTGAGCAACACACATATTCCCAGACCACATGGAGCTCAGGAGCTTGGGGAG 



1570 1590 1610 

AGAGACAGGGAAGTGGACGACTACAGGGCCTTCTGAAACGTGTTGCAGGGAGAAGTGTCA 
+ + + + + + 



1630 1650 1670 

GTCAGGGGATGCTAACCTGGCTTTGGGTAAGGGACAGCCTCTGAATGACAGGACATTAAA 
+ + + + + + 



1690 1710 1730 
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GCCATGGCCTGCAGTTTAAGTAGGAGTTGGCCAGTTCGAGGTAAGAATACCAGTAAGCAA 
+ + + + + + 



1750 1770 1790 

GAACGCCAGAGTAGCTCCTCGAGCTGCCTTCTGTACCTGACATCCACACTGAAGCCAGCC 
+ + ~ + + + + 



1810 1830 1850 

CCTCTGTGTTCAGCCTTGCTTTACTGAAGAGGTGTCGCTGAGGGGCTGCTCTGGGGTGCT 



1870 1890 1910 



1930 1950 1970 

GGTTCTCAGGCCTTTGAACTCAAACTGAATCACACCACTGGCTTTCCTCGTTCTCCAGCT 



1990 2010 2030 

TGCAGATGGCAGATTCGGGAACTTTTTGGCCTCCATAATCACGTGAGCCAATTGCTATAA 
, + + + + + + 



2050 2070 2090 



2110 2130 2150 

T AAG AGC CC CTAACTGGAAAATAACCCTATGGTGC ACTGGTGAGTAGAGAAACTGTGGTT 



2170 2190 2210 

CCCTCAAACCACCGAACACTATTCAGCAATACGAAGGAACAAACTATTGATATGCAAAAT 



2230 2250 2270 

AGTGTAAATGAATCTCAAAAACATCGGAAAGAGGGAAGGAAGCCAGACACAGAAGAGTGC 
+ + + + + + 
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2290 2310 2330 

ATGCCGCATGATTCCATTTATATGAAATTCTAGAACAGGCAAAACTTATCTATAGACAGA 
+ + + + + + 

2350 2370 2390 

GAACAACAGATCAGTGGCTGTCTGGGGTTGGGAGTGGGGAAGTTTGGCTGGAAGGGCACA 
+ + +--• -- + + + 

2410 2430 2450 

GGGCTCTTTCTGTGAGTGAGGGAATGTGTCTGCATTATAGTGATGCTTATGTAGTTATAT 
+ + + + + + 

2470 2490 2510 

ACACTTATCGAAACTCATCTTACTGGCCACTTAAAATAAGTGCATTTTATTGTGTGTAAA 
+ + + + + + 

2530 2550 2570 

TTATAC CTTAATGAAGTTGATTTGAAAATC C AAAGT AGTAATAAT AAGT AATAAT CTCGT 
+ + + + '- + + 

2590 2610 2630 

AGC TGG AC AGCTGTGGTGACTC ACTC CTGTAATTC C AGCG ATTTGAGAAGC TGAGGCAGG 
- _ + + + + + + 

2650 2670 2690 

AGGATCACTTAAGATCAGGAGTTCTTTTTATTTTTATTTTTATTTTTTGAGACGGAGTTT 
+ + + + + + 

2710 2730 2750 

CGCTCTTGTTGCCCAGGCTGGAGTGCAATGGCATGATCTCGGCTCGCTGCAACGTCCACG 
+ + + + + + 

2770 2790 2810 

TTCTGAGTTCAAGCGATTTTCCTGCCTCAGCCTCCCAAGTAGCTGGAACTACAGGCGCTC 



2850 
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ACCACCATGCCCGGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATGTTGGCC 



2890 2910 2930 

AGACTGGTCTTGAACTCCTGACCTCCAGTGATCTGCCCGCCTCGGCCTCCCAAAGTGCTG 



2950 2970 2990 

GGATTACAGGCATGAGACACTGCGCCTGGCCAAGACCAGGAGTTTGAGACCAGCCTGGGA 



3010 3030 3050 

AACAAAGTGAGACCCCCTGTCTACAGAAAAATTAAAAATTTTAGCTGGGCCTGGTGCCGT 
+ + + + + + 



3070 3090 3110 

GTGCCTGTAGTTCCAGCTACTCAGGAGGCTGAGGTGGGAGGATACCTTGAGCCCAGGATT 
+ + +, + + + 



3130 3150 3170 

TCAAGGCTGCAATGAGGCATGATCAGGCCACTGTCCTCTAGCGTGGGTGACAGAGTGAGA 



3190 3210 3230 

CCCTGTCTCTAAATAATAATCATAAGAACAACAAGGACCCTCTAAACGCACTGATATCTA 
4 + h + + + 



3250 3270 3290 

AGGTGTATTAAGCGACCAAAAAAAAAAAGAAAATCAAAGTGCAGAAAAACGTTAATAAGA 



3310 3330 3350 

GAAAAAAATATGTCTGTATTGTCTTGAGTGTGAAAAAATAATCTAAAAGCCTATGAAAGA 



3370 3390 3410 

ACTTTTCCCTA 
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3430 3450 3470 

TGCTTTCTGTACTTTTTGATTTTGAGATATGTGAATGTAGGTTTCTCTCACTGCTCGAAC 
+ + + + + + 



3490 3510 3530 

TTTCACTAACCAAATTACTACATTCCAAATTCTCAAAAACAAATAGATTTACTTAAAAGT 



3550 3570 3590 

AGGCTGGGTGCGGTGTCTCACGCCTGTAATTCCAGCGCTTTGGGAGGCCGAGGCGGGCAG 



3610 3630 3650 

ATCACCTGAGGTCGGG AGTTCGAGAC CAGCCTGACCAACATGGAGAAAC CCCATCTCTAC 
+ + + + + + 



3670 3690 3710 

TAAAAATACAAAATTAGCCAGGCGTGGTGGCGAATGCCTGTAATGCCAGCTACTCGGGAG 
+ + + + + + 

3730 3750 3770 

GCTGAGGCAGAAGAATCACTTGAATCTGGGAGGCAGAGGTTGCAGTGAGCCCAGATCATG 
+ + + + + + 

3790 3810 3830 

CCATTGCACTCCAGTCTGGGTAACAAGAGAGAAACTCTGTCTCAAAAAAAAAAAAAAAAA 
+ + + + + + 

3850 3870 3890 

AAAAGATTTGCTTAAAAGTTAACATCTCCGGCCGGGCGCGGTGGCTCATGCCTGTAATCC 
+ + + + + + 

3910 3930 3950 

CAGCGCTTTGAGAGGCCGAGGCGGGTGGATCACGAGATCAGGAGATTGAGACCATCCTGG 



3990 4010 
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CCAAAATGGTGAAACCTCGTCTCTGCTAAAAATACAAAAGTTAGCTGGGGGTGGTAGCGC 



4030 4050 4070 

GCGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCAGGGAGT 

+ + + + + + 



4090 4110 4130 

CGGAGGTTGCAGTGAGC C AAGAT CGCGCCGCTGCACTCC AGCCTGGCGAC AGAGGGAGAC 



4150 4170 4190 

TCCATC TC AAAAAAAAAAAAAAAAAAAAAGTTAACATC TCATC CAAATTTGCACCGAGTA 
+ + + + + + 



4210 4230 4250 

GG AAAACAAAAGTTTAAAACATGAAACAGATGTT ACTG AGGCCG AAAGGGTCTC C C AGGC 
+ + + + + - + 



4270 4290 4310 

CTGGGAGTCTGCAGCTTTTATGCAATTCTGCCCTCTGGCCACCGCCAGGGAAGAAAGGTT 



4330 4350 4370 

GTCTCCGTCTGCTGCATCGCCTTTGCCCAGCAATGAAGCCCCCAAGACAGCGGCAGCCGG 



4390 4410 4430 

TTGCCTGAACCTTCCTATCCTTGGGGGCACCCAGTGCAGGTGGATGACCCGACTCAACCT 



4450 4470 4490 

CCGCCAGGGCACCCTCGGGGCAGGACGGGTAGCAAGGAGGGGACAGAGATCGGCCCCAGG 



4510 4530 4550 

AGACCACGGAAGATCGCGCTCCTGGGGCCAACTTCAGCAGCGAGAGGCGGCCTTTGCCCA 
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4570 4590 4610 

CCGCCTCATCCCACCACGCCGCGGTCCTCCAAGAACCTTCCCAGCGGTT£TCTCCTCCTC 
+ + + + + + 

4630 4650 4670 

TCAGGAGTAGAGGCCCTCTGAGACCGACGGGGAGGGACGGCTCGGGCCGGTCATCCGAGG 
+ + + + + + 

4690 4710 4730 

GGCCGCACGGATTCCCTCCTCCGCCCAGCTCCACCCCCTCGAGGGGCGGCGGTCCGGGAG 
+ + + + + + 

4750 4770 4790 

TGGCGACCCGGCTCCCCCATGGCGCGCGCCGTCGGGGCCCCTGGCCAGGCTCCGAGCGGG 
+ + + + + + 

4810 4830 4850 

GTTGGCGGGGAGGGGAGGCGGGAGCGAGGGCGGGCGGTGGGAGGTGGGGGCGGGAAGGTC 
+ + + + + + 

4870 4890 4910 

CGAAGGCGGCGGCCTGAGGCTGCACCGGGCACGGGTCGGCCGCAATCCAGCCTGGGCGGA 
+ + + + + + 

4930 4950 4970 

GCCGGAGTTGCGAGCCX3CTGCCTAGAGGCCGAGGAGCTCACAGCTATGGGCTGGAGGCCC 
+ + + + + + 

4990 5010 5030 

CGGAGAGCTCGGGGGACCCCGTTGCTGCTGCTGCTACTACTGCTGCTGCTCTGGCCAGTG 
+ + + + + + 

5050 5070 5090 

CCAGGCGCCGGGGTGCTTCAAGGTGAGGACGCGGGCGGGGTGCGCCCTGAGGGGCAGGCT 
+ + + + + + 

5110 5130 5150 
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AGGCGCGGTGGTGGTGGCGGGGATGGGTTCTGCTCAGAGCTCGGGTCAGCGCGCGGAGGG 



5170 5190 5210 

TCTCACGGCCCCGGCACCATACGGCCAGTAGGTCAGGGCGTGGGGACTCTTTGGGGGGGT 
+ + + h + + 



5230 5250 5270 

CTCCGTGGGACCTGCCCAGGGACGCTCAAGTGTGCTTGGGCTGGCCCCGGGCCCGGACTT 



5290 5310 5330 

GCCCACACTGCCCGGCTGCCACTCCGCTGGCAAAGCAGAGGGCATGGCTCCCTCCCCCTC 
+ + + + + + 



5350 5370 5390 

GGGGACAGCCCAGCCCCCAGCCCCCAGCCCATAGCCGTAGCCCCCTCTGCCTGGATTCTC 
+ - + + + + -- + 



5410 5430 5450 

GCTCTCACAACCAGCTTCCATCCGCAGGCCACCGTGTGACCCGCTCCTGCTCCTCCACCC 
+ + + + + + 



5470 5490 5510 

CTTAGGACTCAGCGGGGCTCCATCCTCTAGGAAGCCCCCATGCCCAAGAGTCCCCCAGAG 
+ + + + + + 



5530 5550 5570 

TCCCTGCTTTGCTCTCAGGCTGCAGAACTAGCTGTGGCCTCCACCCTGCTCACCCCTCGT 



5590 5610 5630 

CCCTCCTCCCAGGGCAGCAGGGCAGTGTGTATGTTGTTTATATTGTTGCCTTGTTTGGTG 
+ + + + + + 



5650 5670 5690 

AGATAGAGAAGGGCCTCTCCAGATAGAAGGTGTCTGTTTAGCAGTGCTCTGGAAAGACTG 
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5710 5730 5750 

CAGCTGTCTCCTCGGGGTAACCCCTCCAAACAAAGATGTTAAGATGGGGCTGGAACAACC 
+ + + + + + 

5770 5790 5810 

TCTGCAAGCGGGTGGGAGG ATT AG CCAGTCCTGCACAGCAAGTGCCTGG CCGGG AACAGG 
+ + + + + + 

5830 5850 5870 

GAGGGCAACCAGGGAGGGGGCATGCGGGGCTGGGCTGTGCTATGCAGACTGGGCGGTGGC 
+ + ---+ + + + 

5890 5910 5930 

TTCCACAGCACTGTGTGGGGACCAAACAGGTACAGGGGCCTGGTCTGTTCTGGCCCCAGG 
+ + , + + + + 

5950 5970 5990 



-+ 



6010 6030 6050 

GGTGATGGGACCCCTGGGAGAGGGGCGTCTATGTGCCCAATACCAGCCTGGCTCCCTCGG 
+ + + + + + 



6070 6090 6110 

GTTCCACCCCCATTCACCCGGTCACCGGAGCTCCAGCTCCAGCTCCAGCTCTGCCCCTCT 
+ + + + + + 

6130 61S0 6170 

CTCCCTC ATTGGGGTC AGGGTGCCCGTGGC C AGC ACGTGCGCG CAAGG CCATGTGGACAG 
+ + + + + ■ + 

6190 6210 6230 

CACCCACACACCACACTGCACCCACACCACACCTGTGCCCGGGC C CACCCTACCTCTTCC 
+ + + + + + 

6250 6270 6290 
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CCAAACCCTTAGAGGCCTAGGAGCAGCAAAGCTTGGTTCTCTACTCTCAGTTAAGTGCTC 
+ + + + + + 



6310 6330 6350 

TCTGGGCTGAGAGACCTCCCCTCCTTCCCCTCCCCCACATCCACTCAGAGCCCTCCCTGC 
+ + + + + + 

6370 6390 6410 

ACTGGCCCCTCTAGCCTCCTTTCCAAGGTGGCAGACTCCTCTCGGCCCTCATCTGCCTGA 
+ + + + + + 

6430 6450 6470 

TGGCAATTCACTCATCCAATCAAGGAGGGCTTCTTGGAGGAAGGGTCTTTGATGTTTGTA 
+ + + + + + 

6490 6510 6530 

GTCTGGGAGAGAAGGTGGAGGAGAAAAAAGGAGTTGGGGTGGCCTAGCAGGAGCTGAGTC 
+ + + + + + 

6550 6570 6590 

ACTTCCACAGGCAGCCATCAGCCCAGCAGGACTGAGGCCAGGGCTGCGTGGAGGGGGGAG 
+ + + + + - + 

6610 6630 6650 

GCTGTCTGTTCTGGGAGCTGGGACTGGGTACCGGGGGAAGGAGGGCTGCTGCAGGCTCTG 
+ + + + + + 

6670 6690 6710 

GGTGCCTGGGGCCTGGCTCCTGCAGGGCGGGCCTGTGAGAGTGGTTGGGGCCAGTGGAGG 
+ + + + + + 

6730 6750 6770 

GGCTGGGAGCATTCCAGGGGAACATTCCAGGCGCCCTCTGAGTAATGCTTGGCTCTGGGA 
. + + + + + + 

6790 6810 6B30 

TTCCTCCTAGAGCCCCCTTAGGCACACCCGGCCAGGGAGCACCAAGGCTCCGTCCGGAAG 
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6850 6870 6890 

CGTCCCCTCCCCTTGAAGAGATGAGGAGGGGCCTTCTGGGCCAGGGTACCAAAACCCTGC 
+ + + + + + 

6910 6930 6950 

CACCAGGACAGAGTCCCCGAGGGAGCTCTGGGCAAGGTGGACCTCGCAAGGCAACATCTG 
+ . + + + + + 

6970 6990 7010 

GCTGTTGTTTTTCTCAGATGATGGGGGGGGCACAAGTGTCCTCTCTTCGTACATCTCTCA 
+ + + + + + 

7030 7050 7070 

CCCTAAAGGCATCTGCTGCCCATCTAAAAATCCCTAAGGCTGCCGCGCTCTTTCCTTCCC 
+ + + + j. + 

7090 7110 7130 

CTCTGCACTGGCX3GCCTTGGCCTCTTCCTTGTGATCGCCGAGCCCAAGCCTGCCCCCCGA 
+ + + _ + + + 

7150 7170 7190 

CAAAGGTCAGGGGACTCCCGTGTCCCCAGCTGAGCTGTCCCTTTCCAGCCTTCTCTTTTC 
+ + + + + + 

7210 7230 7250 

CTCCTCCTTGATAGCTCCTCAGATCCAAGGATGCCCACGGGCGTCCCTCCTTCTCCAGGC 
+ + + + + + 

7270 7290 7310 

TGAGCCCACGCGTGTTGAAGGTGAAGTCTGCCCCAAAAGGCCTCCAGTGCCTCCCTGGGG 
+ + + + + + 

7330 7350 7370 

ATGTCCTCTACCCCCCTCCCTCTGCTTTGTCCCATGCCCCTGTGTTCCTCAGGTCCCCCT 
+ + + + + + 

7390 7410 7430 
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CACCCTGTGCTCTGTCTTTACTCCAGGACATATCCCTGGGCAGCCAGTCACCCCGCACTG 
+ + + + + + 

7450 7470 7490 

GGTCCTGGATGGACAACCCTGGCGCACCGTCAGCCTGGAGGAGCCGGTCAGTGCCATGTC 
+ + + + 1., h 

7510 7530 7550 

TCCCCGCCCTCCACAGGGGCCCTGAACCTCCCAGCCCTTTTGTCTCTCCCTACATTACAG 
+ + + + + + 

7570 7590 7610 

CTTCTAGTTTTGCTGGGGTCCCCAGAACCACCAAGTCACTACTCCTATAGGCCCCTGCCT 
+ + + ~ + + + 

7630 7650 7670 

CC CCTGCCCCTCAAGTGGGC AGAAGAAGGCACTGGGGTTTGG AC ATCTGGATC TCGTGAG 
+ + + + + + 

7690 7710 7730 

CCCGCACACATGGAAGTCATTTCAGCTTTCTCCACCCCACCTCCCTCTTTCCCTCCCTCC 
+ + + + + + 

7750 7770 7790 

CTGGATGATCTGGGCCACCCCCACCCCCACCAGGCAGAAATGGGTCCAGAGTTTGTGGGT 
+ + + + + + 

7810 7830 7850 

CCTGAAGC TTTTC AGG AGCC TCTAAAAAAAAAAAAAAAAAAAAAAGC AC C AAAAAGAAAA 
+ + + + -- + + 

7870 7890 7910 

CCTTTTGC AAAGTTGAC CAGAAC ATGTGAC CCTGTGGACACACTGCTGTCC CTCTC AGGG 
+ + + + + + 

7930 7950 7970 

CCCTGCCACGAAGGCCTGAACCTTCAGCCTCACTGGCTCCTGTGGAATCCACTTCTGGTA 
+ + + + - + + 
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7990 8010 8030 

TGGGGGGGGCAGTGGTCACTCTCCTGATGTCCCCCAGATGTAAGACCACCCCATGTGCTT 
+ + + + + + 



8050 8070 8090 

CTTCTGCAGGACGCTCTGCCCCAGCCTCTTCCCAATCCCGCTCTTCACACGCTTCCAGAA 
+ + + + + + 

8110 8130 8150 

TAACCATGCCCCATCTGTTTGTGCCATAATATCTGTGCTGCAAACTAAGAGGGCAGTAGC 
+ + + + + + 

8170 8190 8210 

CTTGATATGCTCATTTTACAGAGGGGCAAACGGAAGCCCAGAGAGCTTGGGGAAATTGTC 
+ + - + + + + 

8230 8250 8270 

CATGGTCACACAGCTCTTTAGGCTGGGAGCCTGAGACCCACTAAGGTCTGAACGATTTTA 
+ + + + + + 

8290 8310 8330 

AACCATTGGCTACACCCCTGCCCCTCCTAGAGAGCCCTCTTTGTTTGGAATTTTCAGCCC 
+ + + + + + 

8350 8370 8390 

T ACTGTCC AAATCCAGC AAGAGGGAAGGCAGGGGAGCATTGC C ATGAAGG CTG AGAGGCC 
+ + + + + _; + 

8410 8430 8450 

CCCAGAGACCCAGCAGCTCCCAACCCAGGGCCCTCACTGGGATCCCCTAGGCCCATAAGG 
+ + + + + + 

8470 8490 8510 

CCCCCATTCCACTGGTCAAGCACGGCACTGGCCTGAGCTTTGAGATTGCCCTCCCCATCC 
+ + + + + + 

8530 8550 8570 
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CCAGGAGGGGAAGGCTGGACACACACTGGGGTCACTCTGCCTCTGGGCCTCCCTGTCTGT 
+ + -- + + -- + + 

8590 8610 8630 

CTGGCCTGGGCTGTGACCAAGAGGAGAGCCCCAAAGGGGCTCTGCTTCCCCCACCGGTGG 
+ + + + + + 

8650 8670 8690 

GCCCCTGCCCCCAGGAAGCCTGCCAAGATGGTACAGAAGAAAGAGTAGAGGCTAGGTATC 
+ + + + + + 

8710 8730 8750 

CCCTCCAAAAGGCAGGAAACACTCACATTTCAAGATGAGGGGTATATATCAAGGGGCAGG 
+ + + + + + 

8770 8790 8810 

GTACCAGGAGGGCAAGAGTAAAGATAGCAGGGGCTGCAGAGGAACAGGGACCTCGAGTAT 
+ + + + + + 

8830 8850 8870 

GGCCTTTTTCCCGGTGCAGACCTTTCCCCAATAAAGCAAGTGGCATTCCAGCCTCATGAG 
+ + + + + + 

8890 8910 8930 

CTCATGCTGGAGGCCTTGTGGGGCCTGTGGCCAGGGAGGCAAGGACCATCTGCTCCCCAC 
+ + + + + + 

8950 8970 8990 

TTGCGAAGGAAGAACTCCCTCCAAAGACTCTGAGACCCTTGGACAGGGCCCCAGGCCAGT 
+ + + + + + 

9010 9030 9050 

GCATTTTTGGAGAAAAGGAGTCGGGGGTTAAACATTCCGAAGGCGCAGCAGCCTCCCAGG 
+ + + . + + + 

9070 9090 9110 

AAGCTCCTGGGCCGGCTCCAACTCTGGGCCCCCAGCCAGGCTGAGTGGACAAGGGGGAAG 
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9130 9150 9170 

TGGGGTGTTCCCACAGGGTGGGAGACGCCAAGAGGGTGGGGGAAGGAGAGAGGGCTGGCC 
+ - _ + + + + + 



9190 9210 9230 

GTCCAAGCCAGCCTCCTGACACCTAGCTGAGAGCCAGTGTGCTCTCTTGGCTGGAATGGC 
+ + + + + + 

9250 9270 9290 

GTCCATGTTTACTTCGTGGGTCCAGTGAAGCAGGTGTCGGAGCCGGAGGGACGGGGGCTG 
+ + + + + + 

9310 9330 9350 

CTGGAGGCCCAGGAAAACTTTGGAAGAGGGAGCAGTTTGCCAAAATTGGAAGTGGAGGAG 
+ + + + + + 

9370 9390 9410 

TCAAATTTGAATTCTATAGGAAATGAGCAGCAGCTCATTTGGAACCAAGCCTCAGGTAGC 
+ + + + + + 

9430 9450 9470 

AGAGGCTCTGAGGAGGCCCTGACCATGGCTACCCGATGCCCCCATAATGTCCTCAGCACC 
+ + + + + + 

9490 9510 9530 

CCTCTGTCTTCCCCTGCTTTTGATGCCCCTTCTGGGCATGAAAGAAGAGGGCGGGGCCAG 
+ + + + - + + 

9550 9570 9590 

GGGAGGGGCACCTTTCTGGGACCTGTGGTCTCTAGGGAGGATGCTGGTGTGCCTGGCAGG 
+ + , + + + + 

9610 9630 9650 

CTGTGCCAACGCCCTTCCAAGTGGCTGTTGTCAGGACTGCAAACATCCTGAGTTTGGGAA 
_ . + + + + + + 

9670 9690 9710 
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CATCTTTGTATGTTCTCACCTCCTCCACGCCCTCCATAGTATGTGGGGGGTCCTGCTGAC 
+ + + + + + 



9730 9750 9770 

TCCCCAGCCCACGTTCTCCCCAAGAACTTCCTCCCCAGCCGGCTCCACAGGCCACCTACT 
+ + + + + + 



9790 9810 9830 

CCCTGGCAGGCAGGAGGCCTGGAGGCCACCATCTCAGCTCCACACTCTTTCTTGCCCAGG 
+ + + - + - + + 

9850 9870 9890. 

TCTCGAAGCCAGACATGGGGCTGGTGGCCCTGGAGGCTGAAGGCCAGGAGCTCCTGCTTG 
+ + + + + + 

9910 993 0 9950 

AGCTGGAGAAGAACCAGTGAGTGCCAGGCTGGGGTAGGGCTGGGAGGAGGGGATCAGTGT 
+ + -- + + + + 

9970 9990 10010 

TGGGGGGC AGGGACTGAC AC AG ATCTGTGCGGGTGGCTGGATGGGCAGAGGAC C C C AGAG 
+ + + + + + 

10030 10050 10070 

AGGGTGCAGATGACAGGGAGAGTCACGCAGGCCTGTGGTTGGCTCCCTGGAGGCTGAAGA 
+ + + + + + 

10090 10110 10130 

GGACCGCTGAGGCTGTCAGCCCCGCTGTGGGGCACCTCCGCCCTCCCAACCCCAGGAGCG 
+ + + + + + 

10150 10170 10190 

GCTTGTTAGCTCCCTGCTGGCGATGAGTGAGCACCACCTAGTGGACATTTGCAAGATATG 

10210 10230 10250 

CTGAGTCTAAAGAAATCCTAGAGGGAAAAGATGAGCCGGCACCCCAGGCTAAGGGAATGG 
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10270 10290 10310 

CAGGGACCAAGATGCGGTGGCTTTGGGAGGCCGAGGCGGGCGGCTCACCTGAGGTCAGGA 
+ + + + + + 



10330 10350 10370 

GTTTGAGACCAGCCTTGCCAACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAAATT 



10390 10410 10430 

AG CCAGG CG TGGTGG CGG CGCCTGTAATCCC AG CTACTTAGGGGGCTGAGACGGG AG AAT 



10450 10470 10490 

CGCTTGAACCCCGGAGGTAGAGGTTGTGGTGAGCCAAGATCACACCACTGCACCACTCCG 
+ + + + + + 



10510 10530 10550 

G C CTGGG CAAAGAGTGAG ACTCCGTCT CAAAAAAAGAGAAAAAAAAGAAAAAGAAAAAAA 



10570 10590 10610 

AAAAGAAAGAAAAGAAAAAAGAAAAAGATGCAGTGGCTACACTTGGGGGCAGCAGTTTGT 
+ + + + + + 



10630 10650 10670 

CTGACCTGCCTGGAAGGTCTCCATCTACAGGGAGGGGAGCAGGGGGGAATGAATTTGGAG 



10690 10710 10730 

AGTCCCAGGAGGGCCAGATCACAGAAGGCCATTTTGGTGCTCAGTGTCCTGGACCATCCA 
+ + + + + + 



10750 10770 10790 

GAGCCAAAGATTTTGAGCTGGGGAAGGGACAGGCAGACCTGTGCTCAGGAAGGTGCCTTG 
+ + + + + + 



10810 10830 10850 
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GGCTGGGTGGGGTGGGTGTCCGGGCTGGAGCGCAGGCTCTTAAAACCACCCAGATTATGT 



10870 10890 10910 

TATCAGTATATATCACCTACTGAGTGCTTGACCGCAGGCGCTGTTCTGAGGACTTGACAC 
+ + + + + + 



10930 10950 10970 

GTATTTTATTCTCCCTCGTGGAGTCGGATGGACAGGGAACAAACTCTAGTTCCACTGTGC 



10990 11010 11030 

CCAACCATATTTTCCCGACGTCCCTACCCTTTCAATGGGGTGGTCACATCACCTACCTCC 
+ + + _ + + + 



11050 11070 11090 

TAGGGTGGCGGGTGTGTGTGGGGCAGGGGTAGGGGGCAGAGCTGGGGCAGGTGGTGGAAT 



11110 11130 11150 

GCCTGGGAGGGGGGAAGCAGCCATCATTAGCGGGTGGTCTGGAGGTAATGAGGCCAAGGT 
_ + . + + + + + 



11170 11190 11210 

GAGGTTGGGTTAAGGATTTTCTTTAAAGAAGACAGATTGACTTATGATTGATCCATCCGT 



11230 11250 11270 

GTGGGAAAGATCCTGTTGAGATGGAGCCTGAAGATGGAATCATTACCGGAGTGGGTGTGG 



11290 11310 11330 



11350 11370 11390 

GGGGGTGAGCATCAATCACTGAAGGACAGGTGGGAGGTATGTGGGGTCTAGAAGTCTGAG 
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11410 11430 11450 

GAAAATATTTCAAGGATCTAGGGCAGGTGGGGGCAAGAGGGTCGACCAGATGCCCAACAA 
+ + " + + + + 

11470 11490 11510 

AGGAGGGCAGCAGGCAGGGGAACTGGGGGAGGTCACCGCATTTCCCCAACTCCAAGTCCC 
_ + + + + + + 

11530 11550 11570 

ATTCTTCGGCAGTGTCTCCTGACTCCTCCCCTCCCGATCCTGTGGATCCTGCTGCCTGCT 
+ + + + + + 

11590 11610 11630 

GCAGGTCCCCTGGGAACCACAAACTCTTCCCCTATTCCCACTCCTCCCCGGCGTCCTCCC 
+ + 4- + + + 

11650 11670 11690 



11710 11730 11750 

CCCCAAGAGTTTCTGATCTGAGCAAGTCACCATTGCTCCTGTCCCTTCCCTAAGACACAC 
+ + + + + + 

11770 11790 11610 

TGTGAGTG TCTCAC TCATAAAGCTGCTCCATTAGCATTTAGGGAGG AAGGCTGGGAGACA 
+ + + + + + 

11830 11850 11870 

TCCTGGAGGAGGCAGGAGGAAGCTGAATTC AGTGTTC C CTGTAACACCCCCTCTCAGC AG 
+ + + + + + 

11890 11910 11930 

GCTGCTGGCCCCAGGATACATAGAAACCCAGTACGGCCCAGATGGGCAGCCAGTGGTGCT 
+ + + + + + 

11950 11970 11990 
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GGCCCCCAACCACACGGTGAGATGCTTCCATGGGCTCTGGGATGCACCGCCAGAGGTACC 
+ + + + + + 

12010 12030 12050 

CCCCCACCATTCCTACCCCTACTCCTCCTTGCATTCCTAAGGGGCGGTTGGAGCCAGCCC 
- + + + + + + 

12070 12090 12110 

CTACCACACCCTCCCTCTTGCCCCTCTTGCTCCAGCCCTGGCTGAGATTTGGGGCTGGCC 
+ + + + + + 

12130 12150 12170 

CCTTCCTCCCTAGGATCATTGCCACTACCAAGGGCGAGTAAGGGGCTTCCCCGACTCCTG 
+ + + + + + 

12190 12210 12230 

GGTAGTCCTCTGCACCTGCTCTGGGATGAGGTGAGCTCTGGGAGAGGAGGCTGGGCCTGG 
+ + + + + + 

12250 12270 12290 

GATGGGGAAAGAGCTCCCTCACACCCGCTCCTACCCCTCTGCACCCTAGTGGCCTGATCA 
+ + + + + + 

12310 12330 12350 

CC CTCAGCAGGAATGCC AGCTATTATCTGCGTCCCTGGC CAC CC CGGGGC TC CAAGGACT 
+ + + + + + 

12370 12390 12410 

TCTCAACCCACGAGATCTTTCGGATGGAGCAGCTGCTCACCTGGAAAGGA^CCTGTGGCC 
+ + + + + + 

12430 12450 12470 

ACAGGGATCCTGGGAACAAAGCGGGCATGACCAGCCTTCCTGGTGGTCCCCAGAGCAGGG 
+ + + + + + 

12490 12510 12530 

TCAGGGGCATCGATCGGATGGGAGTGGGAATGCTGTATCTATAGCCCT C CAAAT CAGAAG 
+ + + + + + 
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12550 12570 12590 

AGACAGGAATTCACAGGCCTCGAGTCCCAGTATTTTTATTGAAGTCTGAAGAAACAAGTT 
+ + + + + + 



12610 12630 12650 

CCAGAAAACATGTTAAACTTCCTTCTGGGAGCTGGGGTTGGGGGTCAGGGCTCAAGCCCA 



12670 12690 12710 

GCAGCTTCCACTCAGGGTCCCCATTTGCACCTCCGCAGGGCAGGCGAGAAGCGCGCAGGA 
+ + + + _ + + 



12730 12750 12770 

CCCGGAAGTACCTGGAACTGTACATTGTGGCAGACCACACCCTGGTGAGGAGAGACCCCA 
+ + + + + + 



12790 12810 12830 

GGGGTTGGCGGGGTCAGGGATGGGGCCAGCTCAGCCCCTCAAGCCACCGGGATTTCTGCC 



12850 12870 12890 

TTCCCAGTTCTTGACTCGGCACCGAAACTTGAACCACACCAAACAGCGTCTCCTGGAAGT 



12910 12930 12950 

CGCCAACTACGTGGACCAGGTTGGGGGCGGCGGGGAGAGAGCGGTGATGGGGGTGGCGGC 
+ + + + + + 



12970 12990 13010 

GGC AG GACAGGCAGGTGCTGGTGGGGTTTGGGGAAG AGGAAGGG CGCCCC ACGAAGGACC 



13030 13050 13070 

ACCGGCGCGATGGGGCGCCCTGTCCCGGCTTCAGCCCCGCCTCGCCCTCAGCTTCTCAGG 



13090 13110 13130 
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ACTCTGGACATTCAGGTGGCGCTGACCGGCCTGGAGGTGTGGACCGAGCGGGACCGC^GC 



13150 13170 13190 

CGCGTCACGCAGGACGCCAACGCCACGCTCTGGGCCTTCCTGCAGTGGCGCCGGGGSCTG 
+ + + + + + 



13210 13230 13250 

TGGGCGCAGCGGCCCCACGACTCCGCGCAGCTGCTCACGTGGGTGCCTCTGACCCGGACG 



13270 13290 13310 

CGGGTCCCGGGTGGGGCGGCCTCACCTCCCGGCCCCGCCTGGTCACGCCGCGCTCCGCCC 



13330 13350 13370 

CCAGGGGCCGCGCCTTCCAGGGCGCCACAGTGGGCCTGGCGCCCGTCGAGGGCATGTGCC 



13390 13410 13430 

GCGCCGAGAGCTCGGGAGGCGTGAGCACGGTGAGCCCCGCGGGCGGGGGCGAGGGAGAGA 
+ + + + + + 



13450 13470 13490 

CAGGAGGCTCTACGGCCGCAGTGACCGCCCTCCCACGGCCCCCCAGGACCACTCGGAGCT 



13510 13530 13550 

CCCCATCGGCGCCGCAGCCACCATGGCCCATGAGATCGGCCACAGCCTCGGCCTCAGCCA 



13570 13590 13610 

CGACCCCGACGGCTGCTGCGTGGAGGCTGCGGCCGAGTCCGGAGGCTGCGTCATGGCTGC 



13630 13650 13670 

GGCCACCGGGTACGCGGGTGGGGGGTCGGGGCTGCGGCGGGGCGGCTAGTCCTGGGGACT 
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13690 - 13710 13730 

TCCTCCGCTGCGTTTCTTTGGTCGTCCCTCAGTTTCCTCTTCTGTAAAATGGGGATAATG 
+ + + + + + 

13750 13770 13790 

ATCATAGTGTCCGCTTCAGGGTGGTTTATGAGGCTTAT^AGGGAAGAAGCTCAGGCAAAGT 
+ + + + + + 

13810 13B30 13850 

GGATTCTCAACGGTATGAAGATTATTTTCCGAGTAACCTGGCGAGGTTACTCCTACACCG 
+ + + + + + 

13870 13890 13910 

GGAGGAGCACCGTCGGGTCGCGATTCCACCTTGGGTCCCGGGCTGCTCACTATTGGGGCC 
+ + + + + + 

13930 13950 13970 

GCATCGTCCCCTGTCCCGCTTGTTGTGTGACTTTGCGCGGGTTACTTCCCCTCTCTGGGC 
+ + + + + + 

13990 14010 14030 

TCTGCGCGTCTGGCGGCTGTAGCCAAGCCCAGGGGTGGGGATCAGAGAAGCGCGGGGGTT 
+ + + + + + 

14050 14070 14090 

GGGGGACTGTCCCTCCATGCCCAATGCCCTCCCCGTGCCGGTAGGCACCCGTTTCCGCGC 
+ + + + - + + 

14110 14130 14150 

GTGTTCAGCGCCTGCAGCCGCCGCCAGCTGCGCGCCTTCTTCCGCAAGGGGGGCGGCX3CT 
+ + + + + + 

14170 14190 14210 

TGCCTCTCCAATGCCCCGGACCCCGGACTCCCGGTGCCGCCGGCGCTCTGCGGGAACGGC 
+ + + + + + 

14230 14250 14270 
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TTCGTGGAAGCGGGCGSAGGAGTGTGACTGCGGCCCTGGCCAGGTTAAGTCGGCTCGCCCG 
+ + + + + + 

14290 14310 14330 

GCCCCCACTTGCCCTCTCCGCTCAGGTCTGGGGCGCTGCGCCCTCACCTGGGCCCTTCTT 
+ + + + ■ + + 

14350 14370 14390 

GCCTTTCTGGTCCCAGGAGTGCCGCGACCTCTGCTGCTTTGCTCACAACTGCTCGCTGCG 
+ + + + + + 

14410 14430 14450 

CCCGGGGGCCCAGTGCGCCCACGGGGACTGCTGCGTGCGCTGCCTGGTGAGGGCATGGAA 
+ + + + + + 

14470 14490 14510 

GGTTCAGGGTGAGGGTTTCGG.GGAGCTTGGGAGCCGGCCTGTTGGCCTTAGTTAATTGGT 
+ + + + + + 

14530 14550 14570 

GCCCTCAGGTTCCCCCGTTGGGTGCTGGGCTTGGGTAGGCCTGGCTCCCCCAGCTCCGAG 
+ + + + + + 

14590 14610 14630 

CCGCGCTCTCGGCATGGACCTCTCACTGCACGTGGCCTCTCTCTGCCTTCCCCACCACCC 
+ + + + + + 

14650 14670 14690 

GTCACCTGCGCAGCTGAAGCCGGCTGGAGCGCTGTGCCGCCAGGCCATGGGTGACTGTGA 
+ + + + + + 

14710 14730 14750 

CCTCCCTGAGTTTTGCACGGGCACCTCCTCCCACTGTCCCCCAGACGTTTACCTACTGGA 
+ + + + + + 

14770 14790 14B10 

CGGCTCACCCTGTGCCAGGGGCAGTGGCTACTGCTGGGATGGCGCATGTCCCACGCTGGA 
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14830 14850 14870 

GCAGCAGTGCCAGCAGCTCTGGGGGCCTGGTGAGAGGACACGAGCACCCTTGCACCCTGC 
+ + + + + + 

14890 14910 14930 

CCCCCATCCTCTGGTGGGGCCAGTTTTCTACTGTGGGGAAGATGGGCAGGGGAAACTGAG 
+ + + + + + 

14950 14970 14990 

GCCCGCTGAGCGCAGCCCCTCTCCGAGCTGCCCCCAGCCTGGCCCATGCTTCCTCAGGCT 
+ + + + + + 

15010 15030 15050 

CCCACCCAGCTCCCGAGGCCTGTTTCCAGGTGGTGAACTCTGCGGGAGATGCTCATGGAA 
, + + h + + + 

15070 15090 15110 

ACTGCGGCCAGGACAGCGAGGGCCACTTCCTGCCCTGTGCAGGGAGGTAGGGAGTGGAGC 
h + + + + + 

15130 15150 15170 

TGAGTGGAGGGAGCAGAAGCTATGGAGTGGGTTTGGGGAAGGGGGGTACTGCAGCTGTTG 
+ + + + + + 

15190 15210 15230 

ACCCCCCTCTACTTCCTCCCCAGGGATGCCCTGTGTGGGAAGCTGCAGTGCCAGGGTGGA 
+ + h + + + 

15250 15270 15290 

AAGCCCAGCCTGCTCGCACCGCACATGGTGCCAGTGGACTCTACCGTTCACCTAGATGGC 
+ + + + + + 

15310 15330 15350 

CAGGAAGTGACTTGTCGGGGAGGCTTGGCACTCCCCAGTGCCCAGCTGGACCTGCTTGGC 
+ + + + + + 

15370 15390 15410 
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CTGGGCCTGGTAGAGCCAGGCACCCAGTGTGGACCTAGAATGGTGAGCTCTGCCCACCCG 



15430 15450 15470 

ACCCCTCCTTGCCGTTTGAATCCCGCAGGCCAGTGTCCCCCTCACTGCCTGGTGCACTGC 



15490 15510 15530 

CCGTAGGTGTGC CAG AGCAGGCGCTGCAGGAAGAATGCCTTC C AGGAGCTTC AGCGC TGC 
+ + + + + + 



15550 15570 15590 

CTGACTGCCTGCCACAGCCACGGGGTGAGAGCCCGAGGAGTGGGGGTGACCTTGGGGTTC 
+ + + , + + + 



15610 15630 15650 

CTAATCCTACGTGACCCTCCTCTTCTCTTCTCTGCAGGTTTGCAATAGCAACCATAACTG 
+ + + + + + 



15670 15690 15710 

CCACTGTGCTCCAGGCTGGGCTCCACCCTTCTGTGACAAGCCAGGCTTTGGTGGCAGCAT 
+ + + + + + 

15730 15750 15770 

GG AC AGTGGCCC TGTGC AGGCTG AAAGTATGC CAGTGGGGGGCATGTGGGC AGGAGCTGG 
+ + + ~ + + + 

15790 15810 15830 

GGTGGTGCACCTGCTCAGGACTCAGCGCCCCTTCCCCCAATCCCCGCAGACCATGACACC 
+ + + + + + 

15850 15870 15890 

TTCCTGCTGGCCATGCTCCTCAGCGTCCTGCTGCCTCTGCTCCCAGGGGCCGGCCTGGCC 
+ + + + + +' 

15910 15930 15950 

TGGTGTTGCTAC C GACTC CCAGGAGCCC ATCTGCAGCGATGCAGCTGGGG CTGCAGAAGG 
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15970 15990 16010 

GACCCTGCGTGCAGTGGGTAGGCTCCGAGCGCCTGCTTCCTGAGCCTACTCCTGCGGTTC 
+ + : + + + + 

16030 16050 16070 

CCCTCCTCAGAGCTCTGCTGGGGCTGTGGGAGCTGGGGCAGGCCCTCAGCCTTGCCCCCA 
+ + + + + + 

16090 16110 16130 

GGTGCAGAGAGCAGCCCCAGAGGCCATGGAAAGAAGTAGCTTTGAACAGGAGGTTCCAGT 
+ + + + -' + + 

16150 16170 16190 

GGCCTCCCAGTCAAGCGAGGGGGTGGATCCCTGCCCCACCACCAGCACCGCAAGGCATGG 
+ + + + + + 

16210 16230 16250 

CCCTCTACCTCCCAGTACAGCTCCTCTTGTCCACTCTCCTGCTTCTCCCACCAGCTGGCT 
+ + + + + + 

16270 16290 16310 

GCCTCACCCTTGACTTCGCCCTGTTTTTCCCTGGCTCAGATTGCAGTCCCTGTACCATGC 
. + + + + + + 

16330 16350 16370 

TGCCCCCGGAGGCCTGTCCAGCCTCTGTCTCACCAGTTTT£GGCCCTTTGCCACTTCCTC 
+ + + + + + 

16390 16410 16430 

TGCACAAATCACCTCTGTCACCCCCTTGAAGTTCCCAAATGCTGGGCCCAGCACATCTTT 
+ + + + + + 

16450 16470 16490 

TCACTCCATACCACTGGTCAGCTGCGGTGCTGGCTGCCCCTGTGCSAGGGCCCTGCCTTA 
+ + + + + + 

16510 16530 16550 
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16570 16590 16610 

CTGTCCTAGGACTCACCCTGCACCAGGGCCCTAGGCAGGGCAGGCACTCTGTGGCCATGT 



16630 16650 16670 

CTGACATAGC C TGGTCTTGGG AGTGCTCCGGG CAAGC C AAGGG AGATGGC ATGATTTGGG 
+ + + + + + 



16690 16710 16730 
+ + + + + + 

16750 16770 16790 

GCAGCTAAGAGGACCCTGACAAAGCGAGTTGTGATTGAGGGTCTGTGGGCAGAGGAGCAA 
h + + + + + 

16810 16830 16850 

GGTGGCCAGAGCCTGGCGTGTCAGCACGGAGGGGGCGCTGCAGAGGGTGGCGGCTGCTTC 
+ + + + + +. 

16870 16890 16910 

TCATCCCCAGGCGGGAGTCTCAGGGCAGGGGAGAATGTTTTGAAGGAACATCACAGGAAA 
+ + + + + + 

& 

16930 16950 16970 

TGACAAGGCCTTGGGSGATGGGATGGGGACAGTCAAAGATGGCTTGGAATCATCAAGGGC 
+ + + + + + 

16990 17010 17030 

AGCAGGGCACCCAGGGGCAAGGAGAGCAGACATAGCTGCCX3AAGGGGCGGACATCCAAGG 
+ + + + + + 

17050 17070 17090 

TTCTTTGGAAGCTGAGCGATGCCAGCATCTGGAGAGTGCCAGGCTGCTGGGTGGTGTCAG 
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17110 17130 17150 

AGCCTGGAGGAAATGTTAGGACTAGAGAGAGGAGGTGCCAGCCGAGGGCATGAGGCTCAC 
+ + h + h h 

17170 17190 17210 

TTGGAGCCTGGATCCCAAGGCTCCCCTGAAGAGGGAGCAGGAAGGGAGCTGAGAGGGTGA 
+ + + + + + 

17230 17250 17270 

CTTGGAGCAGATGGGTGCCCCAAGAAACTCAGTAAACGCAGAACTCCCTGGGCTGGACAC 
+ + + + + + 

17290 17310 17330 

CATGCTGCGGGGAGGCAATAACCCACTCAGGATCACTGTGCCAACCTCCTGGACTCTTAT 
+ + + + + + 

17350 17370 17390 , 

CACGTTGCTCAGCCCCAAAGATGGCCCACACAGGGACCACCCCCTGGGCGGCGTTCACCC 
+ + + + - + + 

17410 17430 17450 

CATGGAGTTGGGCCCCACAGCCACTGGACAGCCCTGGCCCCTGGGTGAGTGAGGCACCAG 
+ + + + + + 

17470 17490 17510 

GGGGAGGTGGAGAGGGAAGGGAGAAGGGAAGGGCTCATGCCTCCTGCCTCCTTCCAGATG 
+ + + + + + 

17530 17550 17570 

GGCAGCACCCAGTCACCTTGAGTCCCCTATGCCCCTCCCCAGCCCCAGGGTCTCCTGCTG 
+ + + + + + 

17590 17610 17630 

• ACCATATTCACAACATTTACCCTCCACCATTTCTCCCAGACCCTGAGAACTCTCATGAGC 
+ + + + + + 

17650 17670 17690 
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CCAGCAGCCACCCTGAGAAGCCTCTGCCAGCAGTCTCGCCTGACCCCCAAGGTAGGCAGG 
+ + + + + - + 



17710 17730 17750 

GACCTGGATTCAAAGCCTCCCCCTCTCATCGCCCACCCTCCCACCTCTCCCACCCCTCAG 
+ +- - + + + + 



17770 17790 17810 

TTTGCTGCCCCCTAATCAGGTTTCTGGGCTCAGGTTATTATGGAAATGAGTTTATGACCT 
+ + : + + + + 

17830 17850 17870 

CTTGGTTATCATGGAGACCAGGATGCTGGAAGCCCCTGGGCTGGGGAGGGAGAAGCTGTG 
+ + + + + + 

17890 17910 17930 

GCTTTTCCTGGATCACTGGTCCTCACTGAGTGAGGATGGGCTCXCTGCCACACAGCTTGC 
+ + + + + + 

17950 17970 17990 

AG CCTGGGGCCCCAGTCC.TTAGGGGAC AACATATCCTC CTCATTCTCAGCAGATC AAGTC 
+ + + + + + 

18010 18030 18050 

CAGATGCCAAGATCCTGCCTCTGGTGAGAGGTAGCTCCTAAAATGAACAGATTTAAAGAC 
+ + + + + + 

18070 18090 18110 

AGGTGGCCACTGACAGCCACTCCAGGAACTTGAACTGCAGGGGCAGAGCCAGTGAATCAC 
+ + + + + + 

18130 18150 18170 

CGGACCTCCAGCACCTGCAGGCAGCTTGGAAGTTTCTTCCCCGAGTGGAGCTTCGACCCA 
+ + + + + + 

18190 18210 18230 

CCCACTCCAGGAACCCAGAGCCACATTAGAAGTTCCTGAGGGCTGGAGAACACTGCTGGG 
+ + + + + + 
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18250 18270 18290 

CACACTCTCCAGCTCAATAAACCATCAGTCCCAGAAGCAAAGGTCACACAGCCCCTGACC 
+ + + + + + 



18310 18330 18350 

TCCCTCACCAGTGGAGGCTGGGTAGTGCTGGCCATCCCAAAAGGGCTCTGTCCTGGGAGT 
+ + + + + + 



18370 18390 18410 

CTGGTGTGTCTCCTACATGCAATTTCCACGGACCCAGCTCTGTGGAGGGCATGACTGCTG 
+ + + + + + 



18430 18450 18470 

GC CAGAAGCTAGTGGTC CTGGGGCCCTATGGTTCGACTGAGT C C ACACTCC C G TGCAGCC 
+ + + + + + 

18490 18510 18530 

TGGCTGGCCTCTGCAAACAAACATAATTTTGGGGACCTTCCTTCCTGTTTCTTCCCACCC 
+ + + + + + 

18550 18570 18590 

TGTCTTCTCCCCTAGGTGGTTCCTGAGCCCCCACCCCCAATCCCAGTGCTACACCTGAGG 
+ _ + + + + + 

18610 18630 18650 

TTCTGGAGCTCAGAATCTGACAGCCTCTCCCCCATTCTGTGTGTGTCGGGGGGACAGAGG 
+ + + + + + 

18670 18690 18710 

GAACCATTTAAGAAAAGATACCAAAGTAGAAGTCAAAAGAAAGACATGTTGGCTATAGGC 
+ + + + + + 

18730 18750 18770 

GTGGTGGCTCATGCCTATAATCCCAGCACTTTGGGAAGCCGGGGTAGGAGGATCACCAGA 
+ + + + + + 

18790 18810 18830 
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GGCCAGCAGGTCCACACCAGCCTGGGCAACACAGCAAGACACCGCATCTACAGAAAAATT 



18850 18870 18890 

TTAAAATTAGCTGGGCGTGGTGGTGTGTACCTGTAGGCCTAGCTGCTCAGGAGGCTGAAG 
+ + + + + + 



18910 18930 18950 

CAGGAGGATCACTTGAGCCTGAGTTCAACACTGCAGTGAGCTATGGTGGCACCACTGCAC 
+ + + + + + 

18970 18990 19010 

TCCAGCCTGGGTGACAGAGCAAGACCCTGTCTCTAAAATAAATTTTAAAAAGACATATTA 
+ + + + + + 

19030 19050 19070 

ACTTGGACCTTGGTTAGTCTTTTCTGTATGTAAATTC AACCCATGGGGTGC C CTGAGGAC 
+ + + + + + 

19090 19110 19130 

CCACACGGGGTGGTGGTTGGCGGGGTGGTGGTTGGTGGGGTGGTGGCTGACGGGGTGGTG 
+ + + + + + 

19150 19170 19190 

GCTGGCAGGCCGAGCCTAGATGGCAGCCAGAGCCCCAGGCATGTGTCTGGGCACAGGACG 
+ + + + + + 

19210 19230 19250 

GTGTTGCCTAGTTTGAACACCCTCTTTGCTCTGTCACTCCTGGCTCCCTTGGGCGTTCAC 
+ + + + + + 

19270 19290 19310 

ATTCTCCCATTGCTTCATGCAAGAGCTGCTGAGTGGCCTATATCAGCCAGCTGTTGCCGC 
+ + + + + + 

19330 19350 19370 

ATAACAAAACCATCCCAAAACTGAGTGCAGGGAGGCAACTTCACCTCGGGCTCCACTCCA 
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+ + + + + + 

19390 19410 19430 

CAAGCCCAAGGGGCCAGGTGAGAGTGCTCTCTAAAGCCCCCTCCTGCCTCAGTTGTAGTT 
+ + + + + + 

19450 19470 19490 

GCAAAATTTTAATTTATGAAGGTGACTGATGACACAGAGGCCAATGCTGTTGAAATAAGT 
+ + + + + + 

19510 19530 19550 

TATTACTCACAGTTTCCCACCATGCAGGGCCACAGTGGGGAGGCACTAGGTTTGGTCCAG 



19570 19590 19610 

GGACAGAATCAGGAGCGAGTGGAAGGCACAGGCCACAGCCCACAGTGCCGTTTCCACTGG 
+ + + + + + 

19630 19650 19670 

GGAGGCAAGGCAGGCCAGGGGAAGAGGGTAGGATTGGCATTTTGAATCATTCTGGTGGGG 



19690 19710 19730 

TTTGGGGCGTGGGGTTGGGCTCTAATTGTCTGGGTAGGTGCCTGGCCCTGAGCTGGTTTA 
+ + + + + + 

19750 19770 19790 

GGGCAGGGGAAATACTGGTTTCGTATGTGAGAGTTCCTTGAAGGGGGTGGTTGGTGTATG 



19810 19830 19850 

GACTCAAGACTGGTCGGTTTGCATATGAAAGGCATGAGTTGTTTCTGATCTCCAGGAATC 
+ + + + + + 

19870 19890 19910 

AAGCAGTTTCTCTCCAGCCAACAAG CCCCCACC C CGAGATGTTAAACCATCATAAAATAG 
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19930 19950 19970 

AGAATCTAAGGCCAGGCATGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAG 
+ + + + + + 

19990 20010 20030 

G CGGGAGGATC ATTTGAGGTCAG AAGTTCGAG AC C AGCC TGG CCAATGTGGTGAAAC C C C 
+ + + + +- + 

20050 20070 20090 

ATCTCTACTAAAAATACAAAAATTAGCCCGGTGTGGTGGCACGTGCCTATAATCCCAGCT 
~ ~ " + + + + + + 

20110 20130 20150 

ACTCGGGAGG CTGCGG CAGGAG AATTG TTTG AAC ATGGGAGGTGGAG GTTG C AG TGAG CT 
+ + + + + + 

20170 20190 20210 

GAGATCGTGCCACTGCACTCCAGCCTGGGCAACAAGAGCAAGACTCCGTCTCAAAAAAAA 
+ + ~ + + + + 

20230 20250 20270 

AAAAAAAAAAAAGAG AGACTCTAAAAATACACGTTAATATAC CTCCCCCGCTCTTAC CCT 
+ -. + + + + . + 

20290 20310 20330 

TCAGGAGGGGGTGTCTAGACCCCGCGGGACTCCAGCTACAAGGGACCCTGGGGAGGCCAA 
+ ; + + + + + 

20350 20370 20390 

CTCTGCCCTCTTGGCTAATCCCCAAGACTGCCCAGCACCCCCTCCACCCCTTCTCCATTC 
+ + + + + + 

20410 20430 20450 

AGTGGCGAACCCTGGGGAGGCCACGTGGGAAGGAAAGAGGGCTCTAAGAGGGGAGGCCCC 
+ + + ■ + + + 

20470 20490 20510 

AGACTGGGGGAGAGGCCTGTCTGGAGCCCAGGATCACCTGGCTGTGCTGCAGAACTGGAG 
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+ + + + + + 

20530 20550 20570 

AAGAGAAGCTCAGCAGAAAGGAGCTGGCATGGGGCCAACAGCAGAAAAGCAGGAGGCACG 
+ + + + + + 

20590 20610 20630 

CAGAAGTGACTGGGAAGCAGGAGGGTAGGCATGGACCCTGAGGCTGAGCAGGAGGTACTG 
+ + + + + + 

20650 20670 20690 

AGGGGCAGAGTGGACGCTGAGCTGGGGGTAGCGAGCGAGCCCAGCTCAGCTGTGACGCCC 
+ + + + - + + 

20710 20730 20750 

TCTGTTTGGCCACCCAACTACCAGCTACTTGGGCTGCCCCGGGAGGAACTGGGCTTCCTC 



20770 20790 20810 

TGACATTCTGTGGCCTGCGGCCATCTGTCACACCTTCTTCTCTCTCTGCCCCTCCCTTGA 
+ + + + + + 

20830 20850 20870 

CTTGTGGCACCCACAGACAGGTGGGAGAGTGTACCTGCCCTGTGTGGTCAGAGCTTGGTT 
+ + : + + + + 

20890 20910 20930 

TTGAGTTTCCTTCCCTCACCCCTCTTTCCTCCCAGACGCCAAAAGACAAGAGGATGTGTC 
+ + + + + + 

20950 20970 20990 

AGAGGCCTGTGAACCAGAGCAACTCCATCCTGAATAGGGGCTGAGCAAAATAAGGCTGAG 



21010 21030 21050 

ACCTACTGGGCTGCGTTTCCAGACAGTTACAGCATTCTGCGTCACAGGATGAGATAGGAG 
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21070 21090 21110 

ATACAGGTCATAAAGACCTTGCTGATAAAATAGTTTGCAGTAGGCCAGGCGCGGTAGCTC 
+ + + + + + 



21130 21150 21170 

ACGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGCGGATCACCTGAGGTCAGAAGT 
+ + + + + + 

21190 21210 21230 

TCGAGACCAGCCTGGCCAACAAGGTGAAACCTCATCTCTACTAAAAATACAAAAACTAGC 
+ + + + + + 

21250 21270 21290 

CAGG C ATGGTGGTGTGTG CCTGTAATC C C AGCTACTTGGG AGGCTGAGGC AGGAGAATCG 
+ + + + + + 

21310 21330 21350 

CTTGAACCCAGGAGGTGGAGGTTGCAGTGAGCTGAGATCATGTCACTGCACTCCAGCCTG 
+ + + . + + + 

21370 21390 21410 

GAGCGAGACTCCGTCTCAAAAACCAAACCAACCAAAAAAATCAGCTTGCAATATAGAAGC 
+ + + + + + 

21430 21450 21470 

TGGCTAAAACCCACCCAAACCAAGATGGTGATGAGAGTGACCTCTGGTCGGTCCCCACTG 
+ + + + + + 

21490 21510 21530 

CTAC ACTC C C ACC AGC ACCATGAC AGGTT AC AG ATGC C ATGGC AGTATCAGGAAGTTACC 
+ + + + + + 

21550 21570 21590 

ATATATGGT CTAAAAAGGGGAGACATGAAC AATC CACCCCTGTTTAGCAGATCATCCAGA 
+ + + + + + 

21610 21630 21650 

AACAAC C ATAAAAATGGGC AACC AGCAGCCCTC AGGGCTG CGCTCTCTATGGAGTAGCCA 
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21670 21690 21710 

TTCTTTTATTCTTTTACTTTCTTAATAAATGTGCTTTCACTTTATGGACTCGTCTCAAAT 



21730 21750 21770 

TCTTTCTTGCACGAGATCCAAGAACCCTCTCCTGGGGTCTGAATCTGGACCCCTTTCCGG 



21790 21810 21B30 

• TAACAGATGTCGTAGAGTGAAGCACAACCACTGCAGGGGCATCTTGGTTTACATTTTGCT 



21850 21870 21890 

TCAGCGGCCATGGTTAGCACAGCGGAAAGCACATCACAGTCTTCTGATTCATTAAAAAAA 



21910 21930 21950 

TTAGGAAATGGACCACCACAAACCACAGACAGATGTACTGAGACAGGATAGGTAGTCAAG 



21970 21990 22010 

AAAGTGACCATGTTGTAGGCGCGCAGCAGCAACTGTGGTGACCGTACAGTCAACAAGCCT 
+ + + + + + 



22030 22050 22070 

CAGCACTGGCATTGCAATTGAGCTCATTCAAGCAAAGCTATCTTCAGCAGGGACTTCTCC 



22090 22110 22130 

CTCTAGGCAGCAAGCGCATTTTTATTTTACCTGTCCTCAAACTGATCCTTTGCTCCTTAT 



22150 22170 22190 

AAC AGT AAGGAAC ACAC C C CTGTGTGGAG ATTT AAGATGCTAATGAGGCC AAGCG CAGTT 
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22210 22230 22250 

GCTCACGTCTGTAATTCCAGCACTTTGGGAGGCAGAGGTGGGCGGGTCACTTGAGGTTAG 

- - + + + ; + + + 

22270 22290 22310 

AAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCTTGTCTCTACTAAAAATACAAAAAT 
+ + + + + + 

22330 22350 22370 

TAGCCGGGCATGGTGGCGGGCGTCTGTAATCCCAGCTACCTGGGAGGCTGAGGCAGAAGA 
+ + + + + + 

22390 22410 22430 

ATCGCTTGAACCTGGGAGGCGGAGGTTGCAGTGAGCCAAGATCGTGCCACTGCACTCCAG 
+ + + + + + 

22450 22470 22490 



+ , + + + + + 

22510 22530 22550 

TTTTAAAAAAAAGTCAAGACAAATCATAGTGGGGGCTTTTCTGGTCACTTTTTAAATCTT 
+ + + + • + + 

22570 22590 22610 

AGTGTTGAGACTTTATTTGAGACAGGGCCTTACTCTGTTGCCCAGGTTGGATGAGATTTT 
+ + + + + + 

22630 22650 22670 

TAACCTCAATATTTACTTATAGAATAACTTTTTGGTTAGTCAAAACAATGCTGTGTCTCA 
+ + + • h + + 

22690 22710 22730 

TTCTGATCAGAATAAAACATCAGACAACTCAAGAGAAACATTCTGCAAAATAACTGGCCA 
+ h + 1- - + + 

22750 22770 22790 

GGATTCTTCAAAAGTGTCAAGGGTAAAGATAAGGAAAGATGAAGGAACTCCCAGATTGAG 
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-+ 



22810 22830 22850 

GAGAATAAGGAGACAACTGTGATGTGGGATCCTAGAATGGATCTTGGAACAGAAAAAGGA 



22870 22890 22910 

CATTAGTGGAAAAATGAGAAATGCAAAACAGTCTACAGTTTCGTTAACAGGATTGTACCA 



22930 22950 22970 

AGGTTAGTTTCCTAGCTGTAATGATTGGACTATGATTAAGTAAGATGGACCATCAGGGGA 



22990 23010 23030 

AGCTGGGTGAAGGGTGTAAGGAAAATGCTTACATTTTCCAACTTTTCTGCAAGTCTAAAA 



23050 23070 23090 

TTAGTCAACAATAAGAAGTTTAAAATAGGCCAGGCATGGTGGCTCACACCTGTCATCCTA 



23110 23130 23150 

GCACTTTGAGAGGCCGAGGTGGGAGGATGGTTTGAGCCCAGGAGTTCAAGACCAGCCTGC 



23170 23190 23210 

GCAATAGAGCGAGACCCCAACTCTATTCAAAAAAATTTTTTTAAGTTTAAAATAGAATTA 



23230 23250 23270 

TATAAAAAAGGAAAAGAAAAATGCTGTTTCATAGCGTTCCTAGTTTAGCATGGGAGAGAC 



23290 23310 23330 

CAGGTCTCCCTGGGTGGTTGTCTGTGTGTTGCTGGGTGTGCGTGGCAGGGCTAGTGTGTT 
+ + + . + + + 
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23350 23370 23390 

GGGGTCCGTCTAGGCACATTCAGGCGCCGAATCCCGTGGCTCCCAGGTTTACCTGACGGT 



23410 23430 23450 

GCAGCCTGGGGTGGAGACTTAATGAGGGCGGGGAGTTGCTGCAGCAAAGGCTCCTCCCAG 



23470 23490 23510 

GGGTATCAGCGCAGACAGCTGGGTTTTCACTGTGCTCCTGCTCCAGAGGCACTAGGAAGG 



23530 23550 23570 

GGGCGCCTATCAGACTAGGACTCTGCCAGCCATCCTTCTCTGTTGAAGGTCCAGC « 
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10 30 50 

CAGCTATGGGCTGGAGGCCCCGGAGAGCTCGGGGGACCCCGTTGCTGCTGCTGCTACTAC 
+ + + + + + 

MetGlyTrpArgProArgArgAlcArgGlyThrProLeuLeuLeuLeubeulieuL 

70 90 110 

TGCTGCTGCTCTGGCCAGTGCCAGGCGCCGGGGTGCTTCAAGGACATATCCCTGGGCAGC 

+ + + + ~ + + 

euLeuLeuLeuTrpProValProGlyAlaGlyValLeuGlnGlyHisIleProGlyGlnP 

130 150 170 

CAGTCACCCCGCACTGGGTCCTGGATGGACAACCCTGGCGCACCGTCAGCCTGGAGGAGC 

+ + + + + + 

roValThrProHisTrpValLeuAspGlyGlnProTrpArgThrValSerLeuGluGluP 

190 210 230 

CGGTCTCGAAGCCAGACATGGGGCTGGTGGCCCTGGAGGCTGAAGGCGAGGAGCTCCTGC 

+ + + + --+ 

roVal S erLy s P roAspMe tGlyLeuVa 1A1 aLeuG luAl aGluGlyGl nGl uLeuLeuL 

250 270 290 

TTGAGCTGGAGAAGAACCACAGGCTGCTGGCCCCAGGATACATAGAAACCCACTACGGCC 
+ + + + + + 

euGluLeuGluLysAsnHisArgLeuLeuAlaProGlyTyrlleGluThrHisTYrGlyP 

310 330 350 

CAGATGGGCAGCCAGTGGTGCTGGCCCCCAACCACACGGATCATTGCCACTACCAAGGGC 

. + + + + + + 

roAspGlyGlnProValValLeuAlaProAsnHisThrAspHisCysHisTyrGlnGlyA 

370 390 410 

GAGTAAGGGGCTTCCCCGACTCCTGGGTAGTCCTCTGCACCTGCTCTGGGATGAGTGGCC 

+ + + + + + 

rgValArgGlyPheProAspSerTrpValValLeuCysThrCysSerGlyMetSerGlyL 

430 450 470 

TGATCACCCTCAGCAGGAATGCCAGCTATTATCTGCGTCCCTGGCCACCCCGGGGCTCCA 

+ + + + + + 

euIleThrLeuSerArgAsnAlaSerTyrTyrLeuArgProTrpProProArgGlySerL 
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490 510 530 

AGGACTTCTCAACCCACGAGATCTTTCGGATGGAGCAGCTGCTCACCTGGAAAGGAACCT 

+ + + + + + 

ysAspPheSerThrHisGluilePheArgMetGluGlnlieuLeuThrTrpLysGlyThrC 

550 570 590 

GTGGCCACAGGGATCCTGGGAACAAAGCGGGCATGACCAGCCTTCCTGGTGGTCCCCAGA 

"~ + + + + + + 

YSGlyHisArgAspProGlyAsnLysAlaGlyMetThrSerljeuProGlyGlyProGlnS 

610 630 650 

GCAGGGGCAGGCGAGAAGCGCGCAGGACCCGGAAGTACCTGGAACTGTACATTGTGGCAG 
+ + + + + + 

erArgGlyArgArgGluAlaArgArgThrArgLysTyrljeuGluLeuTyrlleValAlaA 

670 690 710 

ACCACACCCTGTTCTTGACTCGGCACCGAAACTTGAACCACACCAAACAGCGTCTCCTGG 
+ + + + + + 

spHisThrLeuPheLeuThrArgHisArgAsnLeuAsnHisThrLysGlnArgbeulieuG 

730 750 770 

AAGTCGCCAACTACGTGGACCAGCTTCTCAGGACTCTGGACATTCAGGTGGCGCTGACCG 

+ + + + + + 

luVal Al aAsnTyrVa 1 AspGlnLeuLeuArgThrLeuAsp 1 1 eGl nVa 1 Al aLeuThrG 

790 S10 830 

GCCTGGAGGTGTGGACCGAGCGGGACCGCAGCCGCGTCACGCAGGACGCCAACGCCACGC 

+ + + + + + 

lyLeuGluValTrpThrGluArgAspArgSerArgValThrGlnAspAlaAsnAlaThrL 

850 B70 890 

TCTGGGCCTTCCTGCAGTGGCGCCGGGGgCTGTGGGCGCAGCGGCCCCACGACTCCGCGC 
+ + + + + + 

euTrpAlaPheLeuGlnTrpArgArgGlyLeuTrpAlaGlnArgProHisAspSerAlaG 

910 930 950 

AGCTGCTCACGGGCCGCGCCTTCCAGGGCGCCACAGTGGGCCTGGCGCCCGTCGAGGGCA 

1- + + + H + 

InLeuLeuThrGlyArgAlaPheGlnGlyAlaThrValGlyLeuAlaProValGluGlyM 
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970 990 1010 

TGTGCCGCGCCGAGAGCTCGGGAGGCGTGAGCACGGACCACTCGGAGCTCCCCATCGGCG 

+ + + + + + 

etCysArgAlaGluSerSerGlyGlyValSerThrAspHisSerGluLeuProIleGlyA 

1030 1050 1070 

CCGCAGCCACCATGGCCCATGAGATCGGCCACAGCCTCGGCCTCAGCCACGACCCCGACG 

+ + _ + + + 

laAlaAlaThrMetAlaHisGluIleGlyHisSerLeuGlyLeuSerHisAspProAspG 

1090 1110 1130 

GCTGCTGCGTGGAGGCTGCGGCCGAGTCCGGAGGCTGCGTCATGGCTGCGGCCACCGGGC 

[. + + h + h 

lyCysCysValGluAlaAlaAlaGluSerGlyGlyCysValMetAlaAlaAlaThrGlyH 

1150 1170 1190 

ACCCGTTTCCGCGCGTGTTCAGCGCCTGCAGCCGCCGCCAGCTGCGCGCCTTCTTCCGCA 

, + + + + + 

i sProPheProArgValPheSerAl aCys Se rArgArgGlnLeuArgAlaPhePheArgL 

1210 1230 1250 

AGGGGGGCGGCGCTTGCCTCTCCAATGCCCCGGACCCCGGAGTCCCGGTGCCGCCGGCGC 

+ + + + + + 

ysGlyGlyGlyAlaCysLeuSerAsnAlaProAspProGlyLeuProValProProAlaL 

1270 1290 1310 

TCTGCGGGAACGGCTTCGTGGAAGCGGGCGAGGAGTGTGACTGCGGCCCTGGCCAGGAGT 

+ + + + + + 

euCysGlyAsnGlyPheValGluAlaGlyGluGluCysAspCysGlyProGlyGlnGluC 

1330 1350 1370 

GCCGCGACCTCTGCTGCTTTGCTCACAACTGCTGGCTGCGCCCGGGGGCCCAGTGCGCCC 

+ + + + + + 

ysArgAspLeuCysCysPheAlaHisAsnCysSerLeuArgProGlyAlaGlnCysAlaH 

1390 1410 1430 

ACGGGGACTGCTGCGTGCGCTGCCTGCTGAAGCCGGCTGGAGCGCTGTGCCGCCAGGCCA 

+ + + + + + 

isGlyAspCysCysValArgCysLeuLeuLysProAlaGlyAlaLeuCysArgGlnAlaM 

1450 1470 1490 
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TGGGTGACTGTGACCTCCCTGAGTTTTGCACGGGCACCTCCTCCCACTGTCCCCCAGACG 

+ + + + + + 

etGlyAspCysAspLeuProGluPheCysThrGlyThrSerSerHisCysProProAspV 

1510 1530 1550 

TTTACCTACTGGACGGCTCACCCTGTGCCAGGGGCAGTGGCTACTGCTGGGATGGCGCAT 

+ + + + + + 

alTyrLeubeuAspGlySerProCysAlaArgGlySerGlyTyrCysTrpAspGlyAlaC 

1570 1590 1610 • 

GTCCCACGCTGGAGCAGCAGTGCCAGCAGCTCTGGGGGCCTGGCTCCCACCCAGCTCCCG 

+ + + + + + 

ysProThrLeuGluGlnGlnCysGlnGlnLeuTrpGlyProGlySerHisProAlaProG 

1630 1650 1670 

AGGCCTGTTTCCAGGTGGTGAACTCTGCGGGAGATGCTCATGGAAACTGCGGCCAGGACA 

+ + + + + h 

luAlaCysPheGlnValValAsnSerAlaGlyAspAlaHisGlyAsnCysGlyGlnAspS 

1690 1710 1730 

GCGAGGGCCACTTCCTGCCCTGTGCAGGGAGGGATGCCCTGTGTGGGAAGCTGCAGTGCC 

+ + + + + + 

erGluGlyHisPheLeuProCysAlaGlyArgAspAlaLeuCysGlyI>ysLeuGlnCysG 

1750 1770 1790 

AGGGTGGAAAGCCCAGCCTGCTCGCACCGCACATGGTGCCAGTGGACTCTACCGTTCACC 

+ + + + + + 

InGlyGlyLysProSerLeuLeuAlaProHisMetValProValAspSerThrValHisL 

1810 1830 1850 

TAGATGGCCAGGAAGTGACTTGTCGGGGAGCCTTGGCACTCCCCAGTGCCCAGCTGGACC 

+ + + + + K 

eiiAspGlyGlnGluValThrCysArgGlyAlaljeuAlaljeuProSerAlaGlnLeuAspli 

1870 1890 1910 

TGCTTGG CC TGG G CCTGGTAGAGCCAGGC ACC CAGTGTGGACCTAGAATGGTGTGCC AGA 

+ + + + + h 

euLeuGlyLeuGlyLeuValGluProGlyThrGlnCysGlyProArgMetValCysGlnS 

1930 1950 1970 
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GCAGGCGCTGCAGGAAGAATGCCTTCCAGGAGCTTCAGCGCTGCCTGACTGCCTGCCACA 

+ + + + + + 

erArgArgCysArgLysAsnAlaPheGlnGluIjeuGlnArgCYsIieuThrAlaCysHisS 

1990 2010 2030 

GCCACGGGGTTTGCAATAGCAACCATAACTGCCACTGTGCTCCAGGCTGGGCTCCACCCT 

+ + + + + + 

erHisGlyValCysAsnSerAsnHisAsnCysHisCysAlaProGlyTrpAlaProProP 

2050 2070 2090 

TCTGTGACAAGCCAGGCTTTGGTGGCAGCATGGACAGTGGCCCTGTGCAGGCTGAAAACC 

+ + + + + + 

heCysAspLysProGlyPheGlyGlySerMetAspSerGlyProValGlnAlaGluAsnH 

2110 2130 2150 

ATGACACCTTCCTGCTGGCCATGCTCCTCAGCGTCCTGCTGCCTCTGCTCCCAGGaGCCG 

+ + + + + + 

isAspThrPhelieuLeuAlaMetLeuLeuSerValLeuLeuProLeuIieuProGlyAlaG 

2170 2190 2210 

GCCTGGCCTGGTGTTGCTACCGACTCCCAGGAGCCCATCTGCAGCGATGCAGCTGGGGCT 

+ + + + + + 

lyLeuAlaTrpCysCysTyrArgLeuProGlyAlaHisLeuGlnArgCysSerTrpGlyC 

2230 2250 2270 

GCAGAAGGGACCCTGCGTGCAGTGGCCCCAAAGATGGCCCACACAGGGACCACCCCCTGG 

h + + + + h 

ysArgArgAspProAlaCysSerGlyProLysAspGlyProHisArgAspHisProLeuG 

2290 2310 2330 

GCGGCGTTCACCCCATGGAGTTGGGCCCCACAGCCACTGGACAGCCCTGGCCCCTGGACC 

+ + + + + . + 

lyGlyValHisProMefeGluLeuGlyProTiirAlaThrGlyGlnProTrpProLeuAspP 

2350 2370 K 2390 

CTGAGAACTCTCATGAGCCCAGCAGCCACCCTGAGAAGCCTCTGCCAGCAGTCTCGCCTG 

+ + + + + + 

roGluAsnSerHisGluProSerSerHisProGluLysProLeuProAlaValSerProA 

2410 2430 2450 

ACCCCCAAGATCAAGTCCAGATGCCAAGATCCTGCCTCTGGTGAGAGGTAGCTCCTAAAA 
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+ . + + - + + + 

spProGlnAspGlnValGlnMetProArgSerCysLeuTrpEnd 



2470 2490 2510 

TGAACAGATTTAAAGACAGGTGGCCACTGACAGCCACTCCAGGAACTTGAACTGCAGGGG 



2530 2550 2570 

C AGAG CCAGTG AATCAC CGGAC CTCC AGC AC CTGCAGGCAG CTTGGAAGTTTCTT C CCCG 
+ + + + + + 



2590 2610 2630 

AGTGGAG CTTCG AC C CACC C ACTCC AGG AAC CC AG AGCCAC ATTAGAAGTTC CTG AGGGC 
+ + + + , + + 

2650 2670 2690 

TGGAGAACACTGCTGGGCACACTCTCCAGCTCAATAAACCATCAGTCCCAGAAGCAAAGG 
+ + + + + + 

2710 2730 2750 

TCACACAGCCCCTGACCTCCCTCACCAGTGGAGGCTGGGTAGTGCTGGCCATCCCAAAAG 
, + + + + + + 

2770 2790 2810 
+ _ + . + + - + + 

2830 2850 2870 

GGAGGGCATGACTGCTGGCCAGAAGCTAGTGGTCCTGGGGCCCTATGGTTCGACTGAGTC 
+ + + + + + 



2890 2910 2930 

CACACTCCCCTGCAGCCTGGCTGGCCTCTGCAAACAAACATAATTTTGGGGACCTTCCTT 
+ + + + + + 



2950 2970 2990 

CCTGTTTCTTCCCACCCTGTCTTCTCCCCTAGGTGGTTCCTGAGCCCCCACCCCCAATCC 
+ + + w + + + 



FIG. 30 



WO 01/78894 



PCT/US01/12245 



142/157 



3010 3030 3050 

CAGTGCTACACCTGAGGTTCTGGAGCTCAGAATCTGACAGCCTCTCCCCCATTCTGTGTG 



3070 3090 3110 

TGTCGGGGGGACAGAGGGAACCATTTAAGAAAAGATACCAAAGTAGAAGTCAAAAGAAAG 



3130 3150 3170 

ACATGTTGGCTATAGGCGTGGTGGCTCATGCCTATAATCCCAGCACTTTGGGAAGCCGGG 
+ + + + + + 



3190 3210 3230 

GTAGGAGGATCAC CAG AGGCCAGC AGGTCCACACCAGCCTGGG CAACACAGCAAGACACC 
+ + + + . + + 

3250 3270 3290 

GCATCTACAGAAAAATTTTAAAATTAGCTGGGCGTGGTGGTGTGTACCTGTAGGCCTAGC 
+ + + + + + 

3310 3330 3350 

TGCTCAGGAGGCTGAAGCAGGAGGATCACTTGAGCCTGAGTTCAACACTGCAGTGAGCTA 
+ + + + + + 

3370 3390 3410 

TGGTGGCACCACTGCACTCCAGCCTGGGTGACAGAGCAAGACCCTGTCTCTAAAATAAAT 
+ + + + + + 

3430 3450 3470 

TTTAAAAAGACATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
+ + + ■ + + + 

3490 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIG. 30 



WO 01/78894 



PCT/US01/12245 



143/157 

10 30 50 

CGGGCACGGGTCGGCCGCAATCCAGCCTGGGCGGAGCCGGAGTTGCGAGCCGCTGCCTAG 

70 90 110 

AGGCCGAGGAGCTCACAGCTATGGGCTGGAGGCCCCGGAGAGCTCGGGGGACCCCGTTGC 
MetGlyTrpArgProArgArgAlaArgGlyThrProLeuL 

130 150 170 

TGCTGCTGCTACTACTGCTGCTGCTCTGGCCAGTGCCAGGCGCCGGGGTGCTTCAAGGAC 
euLeuLeuLeuLeuLeuLeuLeuLeuTrpProValProGlyAlaGlyValLeuGlnGlyH 

190 210 230 

ATATCCCTGGGCAGCCAGTCACCCCGCACTGGGTCCTGGATGGACAACCCTGGCGCACCG 
isIleProGlyGlnProValThrProHisTrpValLeuAspGlyGlnProTrpArgThrV 

2S0 270 290 

TCAGCCTGGAGGAGCCGGTCTCGAAGCCAGACATGGGGCTGGTGGCCCTGGAGGCTGAAG 
alSerLeuGluGluProValSerLysProAspMetGlyLeuValAlalieuGluAlaGluG 

310 330 

GCCAGGAGCTCCTGCTTGAGCTGGAGAAGAACCACAGGC 
lyGlnGluLieuLeuLeuGluLeuGluljysAsnHisArg 
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10 30 50 

CGGGCACGGGTCGGCCGCAATCCAGCCTGGGCGGAGCCGGAGTTGCGAGCCGCTGCCTAG 

70 90 110 

AGGCCGAGGAGCTCACAGCTATGGGCTGGAGGCCCCGGAGAGCTCGGGGGACCCCGTTGC 
MetGlyTrpArgProArgArgAlaArgGlyThrProLeuL 

130 150 170 

TGCTGCTGCTACTACTGCTGCTGCTCTGGCCAGTGCCAGGCGCCGGGGTGCTTCAAGGAC 
euIieuLeuLeuLeuLeuLeuLeuLeuTrpProValProGlyAlaGlyValLeuGlnGlyH 



ATATC CCTGGGCAG CC AGTC ACCCCGC ACTGGGTCCTGGATGGAC 
isIleProGlyGlnProValThrProHisTrpValLeuAspGly 
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10 30 SO 

GC C T AGAGGCC GAGGAGCT CACAGCT ATGGGCT GGAGGC CCCGGAGAGCTC GGGGGACCC 
MetGl yTrpArgProArgAirgAlaAxgGl yThrP 



70 90 110 

CGTTGCTGCTGCTGCTACTACTGCTGCTGCTCTGGCCAGTGCCAGGCGCCGGGGTGCTTC 
roLeuLeuLeuLeuLeuLeuLeuLeuLeuLeuTrpProValProGlyAlaGlyVallieuG 



130 150 170 

AAGGACATATCCCTGGGCAGCCAGTCACCCCGCACTGGGTCCTGGATGGACAACCCTGGC 
lnGl yHi s I leProGlyGlnProValThrProHi sTrpValLeuAspGl yGlnProTrpA 



190 210 230 

GCAC CGTCAGC CT GGAGGAGC C GGTC T CGAAGC CAGACATGGGGC TGGT GGCC CTGGAGG 
r gThrVal S er LeuGluGl uPr oVal S er Ly s P roAspMe t Gl yLeuValAl aLeuGl uA 



250 270 290 

CTGAAGGCCAGGAGCTCCTGCTTGAGCTGGAGAAGAACCACAGGCTGCTGGCCCCAGGAT 
1 aGluGl yGlnGluLeuLeuLeuGluLeuGluLysAsnHi sArgLeuLeuAl aProGl yT 



310 330 350 

ACATAGAAACC CACTAC GGCCCAGATGGGCAGCCAGTGGTGCT GGCCCCCAACCACACGG 
yrl leGluThrHi sTyrGlyProAspGlyGlnproValValLeuAlaProAsnHi sThrV 



370 390 410 

T GAGAT GCTTC CAT GGGCT CTGGGATGCACC GC CAGAGGATCAT TGC CACTAC CAAGGGC 
alArgCysPheHisGlyLeuTrpAspAlaProProGluAspHi sCysHi sTyrGlnGlyA 



430 450 470 

GAGTAAGGGGCTTCCCCGACTCCTGGGTAGTCCTCTGCACCTGCTCTGGGATGAGTGGCC 
rgValArgGlyPheProAspSerTrpValValLeuCysThrCysSerGlyMetSerGlyL 

490 510 530 

TGATCACCCTCAGCAGGAATGCCAGCTATTATCTGCGTCCCTGGCCACCCCGGGGCTCCA 
euIleThrLeuSerArgAsnAlaSerTyrTyrLeuArgProTrpProProArgGlySerli 

550 

AGGAC TT C T CAAC CCAC GAGAT 
y sAspPheS er ThrHi s Gl u 
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10 30 50 

GAGGCCGAGGAGCTCACAGCTATGGGCTGGAGGCCCCGGAGAGCTCGGGGGACCCCGTTG 
MetGlyTrpArgProArgArgAl aArgGlyThrProLeu 



70 90 110 

CTGCTGCTGCTACTACTGCTGCTGCTCTGGCCAGTGCCAGGCGCCGGGGTGCTTCAAGGA 
LeuLeuLeuLeuLeuLeuLeuLeuLeuTrpProValProGlyAlaGlyValLeuGlnGly 



130 150 170 

CATATCCCTGGGCAGCCAGTCACCCCGCACTGGGTCCTGGATGGACAACCCTGGCGCACC 
HislleProGlyGlnProValThrProHisTrpValLeuAspGlyGlnProTrpArgThr 



190 210 230 

GTCAGCCTGGAGGAGCCGGTCTCGAAGCCAGACATGGGGCTGGTGGCCCTGGAGGCTGAA 
ValSerLeuGluGluProValSerLysProAspMetGlyLeuValAlaLeuGluAlaGlu 

250 270 290 

GGC C AGGAGCTCCTGCTTGAGCTGG AG AAGAAC CATGGCCTG ATC ACCCTCAGCAGGAAT 
GlyGl nGl uLeuLeuLeuGl uLeuG 1 uLy s AsnHi s G lyLeul 1 eThrL euS e r Arg As n 



310 330 350 

GCCAGCTATTATCTGCGTCCCTGGCCACCCCGGGGCTCCAAGGACTTCTCAACCCACGAG 
AlaSerTyrTyrLeuArgProTrpProProArgGlySerLysAspPheSerThrHisGlu 



AT 
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10 



GAGGCCGAGGAGCTCACAGCTATGGGCTGGAGGCCCCGGAGAGCTCGGGGGACCCCGTTG 
MetGlyTrpArgProArgArgAlaArgGlyThrProLeu 



CTGCTGCTGCTACTACTGCTGCTGCTCTGGCCAGTGCCAGGCGCCGGGGTGCTTCAAGGA 
LeuLeuLeuLeuLeuIjeuLeuLeuLeuTrpProValProGlyAlaGlyValLeuGlnGly 



CATATCCCTGGGCAGCCAGTCACCCCGCACTGGGTCCTGGATGGACAACCCTGGCGCACC 
HisIleProGlyGlnProValThrProHisTrpVaHieuAspGlyGlnProTrpArgThr 



GTCAGCCTGGAGGAGCCGGTCTCGAAGCCAGACATGGGGCTGGTGGCCCTGGAGGCTGAA 
ValSerLeuGluGluProValSerLysProAspMetGlyLeuValAlaLeuGluAlaGlu 



250 270 290 

GGC CAGGAGCTC C TGCTTGAGCTGGAGAAGAAC CAC AGGCTGC TGGC CC CAGGATACATA 
GlyGlnGluXieuLeuljeuGluLeuGluIjysAsnHisArgLeuIjeuAlaProGlyTyrlle 



310 330 350 

G AAACCC ACTACGGCC C AGATGGGC AGC C AGTGGTGCTGGC CCCC AACC AC ACGGATC AT 
G luThrH i sTy rG 1 y P r o As pG lyG InPro Va 1 Va lLeuAl aProAs nHi s Thr AspH i s 



TGCCACTACCAAGGGCGAGTAAGGGGCTTCCCCGACTCCTGGGTAGTCCTCTGCACCTGC 
CysHisTyrGlnGlyArgValArgGlyPheProAspSerTrpValValLeuCysThrCys 



430 450 470 

TCTGGGATGAGTGGCCTGATCACCCTCAGCAGGAATGCCAGCTATTATCTGCGTCCCTGG 
SerGlyMetSerGlyLeuIleThrLeuSerArgAsnAlaSerTyrTyrLeuArgProTrp 



CCAGCCCGGGGCTCCAAGGACTTCTCAACCCACGAGAT 
ProProArgGlySerLysAspPheSerThrHisGlu 
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10 30 50 

CTGGCCCCAGGATACATAGAAACCCACTACGGCCCAGATGGGCAGCCAGTGGTGCTGGCC 
LeuAlaProGlyTyrlleGluThrHisTyrGlyProAspGlyGlnProValValljeuAla 



70 90 no 

CCCAACCACACGGATCATTGCCACTACCAAGGGCGAGTAAGGGGCTTCCCCGACTCCTGG 
ProAsnHisThrAspHisCysHisTyrGlnGlyArgValArgGlyPheProAspSerTrp 



GTAGTCCTCTGCACCTGCTCTGGGATGAGTGGCCTGATCACCCTCAGCAGGAATGCCAGC 
ValValLeuCysThrCysSerGlyMetSerGlyLeuIleThrLeuSerArgAsnAlaSer 



190 210 230 

TATTATCTGCGTCCCTGGCCACCCCGGGGCTCCAAGGACTTCTCAACCCACGAGATCTTT 
TyrTyrLeuArgProTrpProProArgGlySerLysAspPheSerThrHisGluIlePhe 



CGGATGGAGCAGCTGCTCACCTGGAAAGGAACCTGTGGCCACAGGGATCCTGGGAACAAA 
ArgMetGluGlnLeuLeuThrTrpLysGlyThrCysGlyHisArgAspProGlyAsnLys 



310 330 350 

G 

GCGGGCATGACCAGCCTTCCTGGTGGTCCCCAGAGCAGGGGCAGGCGAAAAGCGCGCAGG 
AlaGlyMetThrSerLeuProGlyGlyProGlnSerArgGlyArgArgLysAlaArgArg 

Glu 

370 390 410 

ACCCGGAAGTACCTGGAACTGTACATTGTGGCAGACCACACCCTGTTCTTGACTCGGCAC 
ThrArgLy s TyrLeuGluLeuTyrI le Va 1 Al aAspHi s ThrLeuPheLeuThr ArgHi s 



430 450 470 

CGAAACTTGAACCACACCAAACAGCGTCTCCTGGAAGTCGCCAACTACGTGGACCAGCTT 
ArgAsnLeuAsnHisThrLysGlnArgLeuLeuGluValAlaAsnTyrValAspGlnLeu 



CTCAGGACTCTGGACATTCAGGTGGC 
LeuArgThrLeviAspIleGlnVal 



FIG. 36 
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10 30 50 

CAGTGGCTACTGCTGGGATGGCGCATGTCCCACGCTGGAGCAGCAGTGCCAGCAGCTCTG 
SerGlyTyrCysTrpAspGlyAlaCysProThrLeuGluGlnGlnCysGlnGlnLeuTr 



70 90 110 

GGGGCCTGGCTCCCACCCAGCTCCCGAGGCCTGTTTCCAGGTGGTGAACTCTGCGGGAGA 
pGlyProGlySerHisProAlaProGluAlaCysPheGlnValValAsnSerAlaGlyAs 

130 150 170 

TGCTCATGGAAACTGCGGCCAGGACAGCGAGGGCCACTTCCTGCCCTGTGCAGGGAGGGA 
pAlaHisGlyAsnCysGlyGlnAspSerGluGlyHisPheLeuProCysAlaGlyArgAs 

190 210 230 

TGCCCTGTGTGGGAAGCTGCAGTGCCAGGGTGGAAAGCCCAGCCTGCTC'GCACCGCACAT 
pAlaLeuCysGlyLysLeuGlnCysGlnGlyGlyLysProSerLeuLeuAlaProHisMe 

250 270 290 

GGTGCCAGTGGACTCTACCGTTCACCTAGATGGCCAGGAAGTGACTTGTCGGGGAGCCTT 
tValProValAspSerThrValHisLeuAspGlyGlnGluValThrCysArgGlyAlaLe 

310 330 350 

GGCACTCCCCAGTGCCCAGCTGGACCTGCTTGGCCTGGGCCTGGTAGAGCCAGGCACCCA 
uAlaLeuProSerAlaGlnLeuAspLeuLeuGlyLeuGlyLeuValGluProGlyThrGl 

370 390 410 

GT GT GGACCTAGAATGGTTTGCAATAGCAACCATAACTGCCACTGTGCTCCAGGCTGGGC 
nCysGlyProArgMetValCysAsnSerAsnHisAsnCysHisCysAlaProGlyTrpAl 

430 450 470 

TCCACCCTTCTGTGACAAGCCAGGCTTTGGTGGCAGCATGGACAGTGGCCCTGTGCAGGC 
aProProPheCysAspLysProGlyPheGlyGlySerMetAspSerGlyProValGlnAl 



490 

TGAAAAC CAT GACACCTTCCT GC 
aGluAsnHisAspThrPheLeu 



FIG. 37 
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10 30 50 

CAGTGGCTACTGCTGGGATGGCGCATGTCCCACGCT GGAGCAGCAGTGCCAGC AGCT CT G 
S e rGl yTyr C ys TrpAspGl yAl a Cy s P roThrLeuGluGlnGlnC ys GlnGlriLe uT r 



70 90 110 

GGGGCCTGGCTCCCACCCAGC TCCC GAGGCCTGTT TCCAGGTGGTGAACTCTGCGGGAGA 
pGlyProGlySerHisProAlaProGluAlaCysPheGlnValValAsnSerAlaGlyAs 



130 150 170 

TGCTCATGGAAAC T GC GGC CAGGACAGCGAGGGC CACTT C CTGCCCT GT GCAGGGAGGGA 
pAlaHi sGlyAsnC y sGL yGlnAspSe rGluGl yHi sPheLeuPr oCysAl aGlyAxgAs 



190 210 230 

TGCCCTGTGTGGGAAGCTGCAGTGCCAGGGTGGAAAGCCCAGCCTGCTCGCACCGCACAT 
pAl aLeuCysGl yLy sLeuGlnC ysGlnGlyGl yLysPr oSerLeuLeuAlaProHi sMe 



250 270 290 

GGTGCCAGTGGACTCTACCGTTCACCTAGATGGCCAGGAAGTGACTTGTCGGGGAGCCTT 
tValProValAapS erThrValHisLeuAspGlyGlnGluValThrCysArgGl yAlaLe 



uAl aLeuPr oS er Al aGlnLeuAspLeuLeuGl yLeuGl yLeuVa lGluPr oGl yThrGl 



370 390 410 

GTGTGGACC TAGAATGGT GTGC C AGAGCAGGC GCT GCAGGAAGAAT GCC T T C CAGGAGC T 
nC y sGl y Pr oArgMe tVal Cy sGlnS e rAr gAr gC y sAr gLys AsnAl aPheGlnGluLe 



430 450 470 

TCAGCGCTGCCTGACTGCCTGCCACAGCCACGGGGTTTGCAATAGCAACCATAACTGCCA 
uGlnAr gCy sLeuThrAl a C y sHi s s erHi a Gl yValC ysAsnSe rAsnHi aAanC ysHi 



490 510 530 

CTGTGCTCCAGGCTGGGCTCCACCCTTCTGTGACAAGCCAGGCTTTGGTGGCAGCATGGA 
sCyaAlaProGlyTrpAlaProProPheCysAspLysProGlyPheGlyGlySexMetAs 



CAGTGGCCCTGTGCAGGCTGAAAACCATGACACCTTCCTGC 
pSerGlyProValGlnAlaGluAsnHisAspThrPheLeu 



FIG. 38 
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10 30 50 

CAGTGGCTACTGCTGGGATGGCGCATGTCCCACGCTGGAGCAGCAGTGCCAGCAGCTCTG 
SerGlyTyrCysTrpAspGlyAlaCysProThrLeuGluGlnGlnCysGlnGlnLeuTr 

70 90 110 

GGGGCCTGATGGCCAGGAA.GTGACTTGTCGGGGAGCCTTGGCACTCCCCAGTGCCCAGCT 
pGlyProAspGlyGlnGluValThrCysArgGlyAlaLeuAlaLeuProSerAlaGlnLe 

130 150 170 

GGACCTGCTTGGCCTGGGCCTGGTAGAGCCAGGCACCCAGTGTGGACCTAGAATGGTGTG 
•uAspLeuLeuGlyLeuGlyLeuValGluProGlyThrGlnCysGlyProArgMetValCy 

190 210 230 

CCAGAGCAGGCGCTGCAGGAAGAATGCCTTCCAGGAGCTTCAGCGCTGCCTGACTGCCTG 
sGlnSerArgArgCysArgLysAsnAlaPheGlnGluLeuGlnArgCysLeuThrAlaCy 

250 270 290 

CCACAGCCACGGGGTTTGCAATAGCAACCATAACTGCCACTGTGCTCCAGGCTGGGCTCC 
sHisSerHisGlyValCysAsnSerAsnHisAsnCysHisCysAlaProGlyTrpAlaPr 

310 330 350 

ACCCTTCTGTGACAAGCCAGGCTTTGGTGGCAGCATGGACAGTGGCCCTGTGCAGGCTGA 
oProPheCysAspLysProGlyPheGlyGlySerMetAspSerGlyProValGlnAlaGl 

370 

AAACCATGACACCTTCCTGC 
uAsnHisAspThrPheLeu 



FIG. 39 
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10 30 50 

GGCCTGGTGTTGCTACCGACTCCCAGGAGCCCATCTGCAGCGATGCAGCTGGGGCTGCAG 
AlaTrpCysCysTyrArgLeuProGlyAlaHisLeuGlnArgCysSerTrpGlyCysAr 



70 90 HO 

AAGGGACCCTGCGTGCAGTGGCCCCAAAGATGGCCCACACAGGGACCACCCCCTGGGCGG 
gArgAspProAlaCysSerGlyProLysAspGlyProHisArgAspHisProLeuGlyGl 



130 150 170 

CGTTCACCCCATGGAGTTGGGCCCCACAGCCACTGGACAGCCCTGGCCCCTGGACCCTGA 
yValHisProMetGluLeuGlyProThrAlaThrGlyGlnProTrpProLeuAspProGl 



190 210 230 

GAACTCTCATGAGCCCAGCAGCCACCCTGAGAAGCCTCTGCCAGCAGTCTCGCCTGACCC 
uAsnSerHisGluProSerSerHisProGluLysProLeuProAlaValSerProAspPr 



250 270 290 

CCAAGCAGATCAAGTCCAGATGCCAAGATCCTGCCTCTGGTGAGAGGTAGCTCCTAAAAT 
oGlnAlaAspGlnValGlnMetProArgSerCysLeuTrpEnd 



310 

GAACAGATTTAAAGACAGGTGGCC 



FIG. 40 
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CftGTGGCTACTGCTGGGATGGCGCATGTCCCACGCTGGAGCAGCAGTGCCAGCAGCTCTG 
SerGlyTyrCysTrpAspGlyAlaCysProThrLeuGluGlnGlnCysGlnGlnLeuTr 



GGGGCCTGATGGCCAGGAAGTGACTTGTCGGGGAGCCTTGGCACTCCCCAGTGCCCAGCT 
pGlyProAspGlyGlnGluValThrCysArgGlyAlaLeuAlaLeuProSerAlaGlnLe 



GGACCTGCTTGGCCTGGGCCTGGTAGAGCCAGGCACCCAGTGTGGACCTAGAATGGTGTG 
uAspLeuLeuGlyLeuGlyLeuValGluProGlyThrGlnCysGlyProArgMetValCy 



CCAGAGCAGGCGCTGCAGGAAGAATGCCTTCCAGGAGCTTCAGCGCTGCCTGACTGCCTG 
sGlnSerArgArgCysArgLysAsnAlaPheGlnGluLeuGlnArgCysLeuThrAlaCy 



CCACAGCCACGGGGTTTGCAATAGCAACCATAACTGCCACTGTGCTCCAGGCTGGGCTCC 
sHisSerHisGlyValCysAsnSerAsnHisAsnCysHisCysAlaProGlyTrpAlaPr 



ACCCTTCTGTGACAAGCCAGGCTTTGGTGGCAGCATGGACAGTGGCCCTGTGCAGGCTGA 
oProPheCysAspLysProGlyPheGlyGlySerMetAspSerGlyProValGlnAlaGl 



AAACCATGACACCTTCCTGCTGGCCATGCTCCTCAGCGTCCTGCTGCCTCTGCTCCCAGG 
uAsnHisAspThrPheLeuLeuAlaMetLeuLeuSerValLeuLeuProLeuLeuProGl 



GGCCGGCCTGGCCTGGTGTTGCTACCGACTCCCAGGAGCCCATCTGCAGCGATGCAGCTG 
yAlaGlyLeuAlaTrpCysCysTyrArgLeuProGlyAlaHisLeuGlnArgCysSerTr 



GGGCTGCAGAAGGGACCCTGCGTGCAGTGGCCCCAAAGATGGCCCACACAGGGACCACCC 
pGlyCysArgArgAspProAlaCysSerGlyProLysAspGlyProHisArgAspHisPr 



oLeuGlyGlyValHisProMetGl uLeuGlyProThrAlaTbrGlyGlnProTrpProLe 

610 630 650 

GGACCCrGAGAACTCTCATGAGCOCAGCAGCCACCCTGAGAAGCCTCTGCCAGCAGTCTC 
uAspProGluAsnSerHisGluProSerSerHisProGluLysProLeuProAlaValSe 



GCCTGACOCCCAAGCAGATCAAGTCCAGATGCCAAGATCCTGCCTCTGGTGAGAGGTAGC 
rProAspProGlnAlaAspGlnVaXGlnMetProArgSerCysLeuTrpEnd 

730 750 

T CCTAAAATGAACAGATTTAAAGACAGGTGGCC 



FIG. 41 
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10 30 50 

CAGTGGCTACTGCTGGGATGGCGCATGTCCCACGCTGGAGCAGCAGTGCCAGCAGCTCTG 
SerGlyTyrCysTrpAspGlyAlaCysProThrLe-uGluGlnGlnCysGlnGlnLeuTr 

70 90 110 

GGGGCCTGGCTCCCACCCAGCTCCCGAGGCCTGTTTCCAGGTGGTGAA.CTCTGCGGGAGA 
pGlyProGlySerHisProAlaProGluAlaCysPheGlnValValAsnSerAlaGlyAs 

130 150 170 

TGCTCATGGAAACTGCGGCCAGGACAGCGAGGGCCACTTCCTGCCCTGTGCAGGGAGGGA 
pAlaHisGlyAsnCysGlyGlnAspSerGluGlyHisPheLeuProCysAlaGlyArgAs 

190 210 230 

TGCCCTGTGTGGGAAGCTGCAGTGCCAGGGTGGAAAGCCCAGCCTGCTCGCACCGCACAT 
pAlaLeuCysGlyLysLeuGlnCysGlnGlyGlyLysProSerLeuLeuAlaProHisMe 

250 270 290 

GGT GCCAGTGGACT CTACC GTTCACCTAGATGGCCAGGAAGT GACTT GT C GGGGAGCCTT 
tValProValAspSerThrValHisLeuAspGlyGlnGluValThrCysArgGlyAlaLe 

310 330 350 

■ •»••• 
GGCACTCCCCAGTGCCCAGCTGGACCTGCTTGGCCTGGGCCTGGTAGAGCCAGGCACCCA 
uAlaLeuProSerAlaGlnLeuAspLeuLeuGlyLeuGlyLeuValGluProGlyThrGl 

370 390 410 

GTGTGGACCTAGAATGGTGTGCCAGAGCAGGCGCTGCAGGAAGAATGCCTTCCAGGAGCT 
nCysGlyProArgMetValCysGlnSerArgArgCysArgLysAsnAlaPheGlnGluLe 

430 450 470 

TCAGCGCTGCCTGACTGCCTGCCACAGCCACGGGGTTTGCAATAGCAACCATAACTGCCA 
uGlnArgCysLeuThrAlaCysHisSerHisGlyValCysAsnSerAsnHisAsnCysHi 

490 510 530 

CTGTGCTCCAGGCTGGGCTCCACCCTTCTGTGACAAGCCAGGCTTTGGTGGCAGCATGGA 
sCysAlaProGlyTrpAlaProProPheCysAspLysProGlyPheGlyGlySerMetAs 

550 570 590 

CAGTGGCCCTGTGCAGGCTGAAAACCATGACACCTTCCTGCTGGCCATGCTCCTCAGCGT 
pSerGlyProValGlnAlaGluAsnHisAspThrPheLeuLeuAlaMetLeuLeuSerVa 

610 630 650 

CCTGCTGCCTCTGCTCCCAGGGGCCGGCCTGGCCTGGTGTTGCTACCGACTCCCAGGAGC 
iLeuLeuProLeuLeuProGlyAlaGlyLeuAlaTrpCysCysTyrArgLeuProGlyAl 

670 690 710 
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CCATCTGCAGCGATGCAGCTGGGGCTGCAGAAGGGACCCTGCGTGCAGTGGCCCCAAAGA 
aHisLeuGlnArgCysSerTrpGlyCysArgArgAspProAlaCysSerGlyProLysAs 

730 750 770 

TGGCCCACACAGGGACCACCCCCTGGGCGGCGTTCACCCCATGGAGTTGGGCCCCACAGC 
pGlyProHisArgAspHisProLeuGlyGlyValHisProMetGluLeuGlyProThrAl 

790 610 830 

CACTGGACAGCCCTGGCCCCTGGACCCTGAGAACTCTCATGAGCCCAGCAGCCACCCTGA 
aThrGlyGlnProTrpProLeuAspProGluAsnSerHisGluProSerSerHisProGl 

850 870 890 

GAAGCCT CTGCCAGCAGTCTCGCCTGACCCCCAAGAT CAAGT CC AGAT GC CAAGATCCTG 
uLysProLeuProAlaValSerProAspProGlnAspGlnValGlnMetProArgSerCy 

910 930 950 

CCTCTGGTGAGAGGTAGCTCCTAAAATGAACAGATTTAAAGACAGGTGGCCACTGACAGC 
sLeuTrpEnd 

970 990 ' 1010 

CACTCCAGGAACTTGAACTGCAGGGGCAGAGCCAGTGAATCACCGGACCTCCAGCACCTG 

•1030 1050 1070 

CAGGCAGCTTGGAAGTTTCTTCCCCGAGTGGAGCTTCGACCCACCCACTCCAGGAACCCA 

1090 1110 1130 

GAGCCACATTAGAAGTTCCTGAGGGCTGGAGAACACTGCTGGGCACACTCTCCAGCTCAA 

1150 
TAAACCATCAGTCC 

rt772 
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10 30 50 

CAGTGGCTACTGCTGGGATGGCGCATGTCCCACGCTGGAGCAGCAGTGCCAGCAGCTCTG 
SerGlyTyrCysTrpAspGlyAlaCysProThrLeuGluGlnGlnCysGlnGlnLeuTr 



70 90 110 

GGGGCCTGATGGCCAGGAAGTGACTTGTCGGGGAGCCTTGGCACTCCCCAGTGCCCAGCT 
pGlyProAspGlyGlnGluValThrCysArgGlyAlaLeuAlaLeuProSerAlaGlnLe 



130 150 - 170 

GGACCTGCTTGGCCTGGGCCTGGTAGAGCCAGGCACCCAGTGTGGACCTAGAATGGTGTG 
uAspLeuLeuGlyLeuGlyLeuValGluProGlyThrGlnCysGlyProArgMetValCy 



190 210 • 230 

CCAGAGCAGGCGCTGCAGGAAGAATGCCTTCCAGGAGCTTCAGCGCTGCCTGACTGCCTG 
sGlnSerArgArgCysArgLysAsnAlaPheGlnGluLeuGlnArgCysLeuThrAlaCy 



250 270 290 

CCACAGCCACGGGGTTTGCAATAGCAACCATAACTGCCACTGTGCTCCAGGCTGGGCTCC 
sHisSerHisGlyValCysAsnSerAsnHisAsnCysHisCysAlaProGlyTrpAlaPr 



310 330 350 

ACCCTTCTGTGACAAGCCAGGCTTTGGTGGCAGCATGGACAGTGGCCCTGTGCAGGCTGA 
0ProPheCysAspLysProGlyPheGlyGlySerMetAspSerGlyProValGlnAlaGl 



370 390 410 

AAACCATGACACCTTCCTGCTGGCCATGCTCCTCAGCGTCCTGCTGCCTCTGCTCCCAGG 
uAsnHisAspThrPheLeuLeuAlaMetLeuLeuSerValLeuLeuProLeuLeuProGl 



430 450 470 

GGCCGGCCTGGCCTGGTGTTGCTACCGACTCCCAGGAGCCCATCTGCAGCGATGCAGCTG 
yAlaGlyLeuAlaTrpCysCysTyrArgLeuProGlyAlaHisLeuGlnArgCysSerTr 



490 510 530 

GGGCTGCAGAAGGGACCCTGCGTGCAGTGGCCCCAAAGATGGCCCACACAGGGACCACCC 
pGlyCysArgArgAspProAlaCysSerGlyProLysAspGlyProHisArgAspHisPr 



550 570 590 
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CCTGGGCGGCGTTCACCCCATGGAGTTGGGCCCCACAGCCACTGGACAGCCCTGGCCCCT 
oLeuGlyGlyV^lHisProMetGluLeuGlyProThrAlaThrGlyGlnProTrpProLe 

610 630 650 

G GACC CTGAGAACT CT CAT GAGC C CAGC AGC CACCCT GAGAAGC CT CT GC CAGCAGT CT C 
uAspProGluAsnSerHisGluProSerSerHisProGluLysProLeuProAlaValSe 

670 690 710 

GCCTGACCCCCAAGATCAAGTCCAGATGCCAAGArCCTGCCrCTGGTGAGAGGTAGCTCC 
rProAspProGlnAspGlnValGlnMetProArgSerCysLeuTrpEnd 

730 750 770 

TAAAAT GAACAGATTTAAAGACAG GT GGCCACTGACAGCCACTC CAG G AACTT GAACTGC 

790 810 830 

AGGGGCAGAGCCAGTGAATCACCGGACCTCCAGCACCTGCAGGCAGCTTGGAAGTTTCTT 

850 870 890 

CCCCGAGTGGAGCTTCGACCCACCCACTCCAGGAACCCAGAGCCACATTAGAAGTTCCTG 

910 930 950 

AGGGCTGGAGAACACTGCTGGGCACACTCTCCAGCTCAATAAACCATCAGTCC 
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SEQUENCE LISTING 
<110> GENOME THERAPEUTICS CORPORATION 



<120> NOVEL HUMAN GENE RELATING TO RESPIRATORY DISEASES AND 
OBESITY 



<130> 2976-4039PCT 



<140> 
<141> 



<160> 363 



<17 0> Patent In Ver. 2.1 



<210> 1 
<211> 3626 
<212> DNA 

<213> Homo sapiens 



<400> 1 

cgggcacggg tcggccgcaa tccagcctgg gcggagccgg agttgcgagc cgctgcctag 60 
aggccgagga gctcacagct atgggctgga ggccccggag agctcggggg accccgttgc 12 0 
tgctgctgct actactgctg ctgctctggc cagtgccagg cgccggggtg cttcaaggac 180 
atatccctgg gcagccagtc accccgcact gggtcctgga tggacaaccc tggcgcaccg 24 0 
tcagcctgga ggagccggtc tcgaagccag acatggggct ggtggccctg gaggctgaag 3 00 
gccaggagct cctgcttgag ctggagaaga accacaggct gctggcccca ggatacatag 360 
aaacccacta cggcccagat gggcagccag tggtgctggc ccccaaccac acggtgagat 42 0 
gcttccatgg gctctgggat gcaccgccag aggatcattg ccactaccaa gggcgagtaa 480 
ggggcttccc cgactcctgg gtagtcctct gcacctgctc tgggatgagt ggcctgatca 54 0 
ccctcagcag gaatgccagc tattatctgc gtccctggcc accccggggc tccaaggact 600 
tctcaaccca cgagatcttt cggatggagc agctgctcac ctggaaagga acctgtggcc 660 
acagggatcc tgggaacaaa gcgggcatga ccagccttcc tggtggtccc cagagcaggg 72 0 
gcaggcgaga agcgcgcagg acccggaagt acctggaact gtacattgtg gcagaccaca 780 
ccctgttctt gactcggcac cgaaacttga accacaccaa acagcgtctc ctggaagtcg 840 
ccaactacgt ggaccagctt ctcaggactc tggacattca ggtggcgctg accggcctgg 900 
aggtgtggac cgagcgggac cgcagccgcg tcacgcagga cgccaacgcc acgctctggg 960 
ccttcctgca gtggcgccgg gggctgtggg cgcagcggcc ccacgactcc gcgcagctgc 102 0 
tcacgggccg cgccttccag ggcgccacag tgggcctggc gcccgfccgag ggcatgtgcc 1080 
gcgccgagag ctcgggaggc gtgagcacgg accactcgga gctccccatc ggcgccgcag 114 0 
ccaccatggc ccatgagatc ggccacagcc tcggcctcag ccacgacccc gacggctgct 1200 
gcgtggaggc tgcggccgag tccggaggct gcgtcatggc tgcggccacc gggcacccgt 12 60 
ttccgcgcgt gttcagcgcc tgcagccgcc gccagctgcg cgccttcttc cgcaaggggg 132 0 
gcggcgcttg cctctccaat gccccggacc ccggactccc ggtgccgccg gcgctctgcg 13 80 
ggaacggctt cgtggaagcg ggcgaggagt gtgactgcgg ccctggccag gagtgccgcg 1440 
acctctgctg ctttgctcac aactgctcgc tgcgcccggg ggcccagtgc gcccacgggg 1500 
actgctgcgt gcgctgcctg ctgaagccgg ctggagcgct gtgccgccag gccatgggtg 1560 
actgtgacct ccctgagttt tgcacgggca cctcctccca ctgtccccca gacgtttacc 1620 
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tactggacgg ctcaccctgt gccaggggca 
cgctggagca gcagtgccag cagctctggg 
gtttccaggt ggtgaactct gcgggagatg 
gccacttcct gccctgtgca gggagggatg 
gaaagcccag cctgctcgca ccgcacatgg 
gccaggaagt gacttgtcgg ggagccttgg 
gcctgggcct ggtagagcca ggcacccagt 
gctgcaggaa gaatgccttc caggagcttc 
gggtttgcaa tagcaaccat aactgccact 
acaagccagg ctttggtggc agcatggaca 
ccttcctgct ggccatgctc ctcagcgtcc 
cctggtgttg ctaccgactc ccaggagccc 
gggaccctgc gtgcagtggc cccaaagatg 
ttcaccccat ggagttgggc cccacagcca 
actctcatga gcccagcagc caccctgaga 
aagcagatca agtccagatg ccaagatcct 
acagatttaa agacaggtgg ccactgacag 
agccagtgaa tcaccggacc tccagcacct 
ggagcttcga cccacccact ccaggaaccc 
agaacactgc tgggcacact ctccagctca 
cacagcccct gacctccctc accagtggag 
tctgtcctgg gagtctggtg tgtctcctac 
gggcatgact gctggccaga agctagtggt 
actcccctgc agcctggctg gcctctgcaa 
gtttcttccc accctgtctt ctcccctagg 
tgctacacct gaggttctgg agctcagaat 
cggggggaca gagggaacca tttaagaaaa 
tgttggctat aggcgtggtg gctcatgcct 
ggaggatcac cagaggccag caggtccaca 
tctacagaaa aattttaaaa ttagctgggc 
tcaggaggct gaagcaggag gatcacttga 
tggcaccact gcactccagc ctgggtgaca 
aaaaagacat aaaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa aaaaaaaaaa aaaaaa 



2/154 

gtggctactg ctgggatggc gcatgtccca 1680 
ggcctggctc ccacccagct cccgaggcct 1740 
ctcatggaaa ctgcggccag gacagcgagg 180 0 
ccctgtgtgg gaagctgcag tgccagggtg 1860 
tgccagtgga ctctaccgtt cacctagatg 1920 
cactccccag tgcccagctg gacctgcttg 1980 
gtggacctag aatggtgtgc cagagcaggc 2040 
agcgctgcct gactgcctgc cacagccacg 2100 
gtgctccagg ctgggctcca cccttctgtg 2160 
gtggccctgt gcaggctgaa aaccatgaca 222 0 
tgctgcctct gctcccaggg gccggcctgg 2280 
atctgcagcg atgcagctgg ggctgcagaa 234 0 
gcccacacag ggaccacccc ctgggcggcg 2400 
ctggacagcc ctggcccctg gaccctgaga 24 6 0 
agcctctgcc agcagtctcg cctgaccccc 2 520 
gcctctggtg agaggtagct cctaaaatga 258 0 
ccactccagg aacttgaact gcaggggcag 2640 
gcaggcagct tggaagtttc ttccccgagt 2 700 
agagccacat tagaagttcc tgagggctgg 276 0 
ataaaccatc agtcccagaa gcaaaggtca 2 82 0 
gctgggtagt gctggccatc ccaaaagggc 2880 
atgcaatttc cacggaccca gctctgtgga 294 0 
cctggggccc tatggttcga ctgagtccac 3 000 
acaaacataa ttttggggac cttccttcct 3060 
tggttcctga gcccccaccc ccaatcccag 312 0 
ctgacagcct ctcccccatt ctgtgtgtgt 318 0 
gataccaaag tagaagtcaa aagaaagaca 324 0 
ataatcccag cactttggga agccggggta 3300 
ccagcctggg caacacagca agacaccgca 3360 
gtggtggtgt gtacctgtag gcctagctgc 342 0 
gcctgagttc aacactgcag tgagctatgg 3480 
gagcaagacc ctgtctctaa aataaatttt 3540 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3600 

3626 



<210> 2 

<211> 227 

<212> DNA 

<213> Homo sapiens 

<400> 2 

accgggcacg ggtcggccgc aatccagcct 
agaggccgag gagctcacag ctatgggctg 
gctgctgctg ctactactgc tgctgctctg 
acatatccct gggcagccag tcaccccgca 



gggcggagcc ggagttgcga gccgctgcct 60 
gaggccccgg agagctcggg ggaccccgtt 120 
gccagtgcca ggcgccgggg tgcttcaagg 180 
ctgggtcctg gatggac 227 



<210> 3 

<211> 3509 

<212> DNA 

<213> Homo sapiens 

<400> 3 

cagctatggg ctggaggccc cggagagctc gggggacccc gttgctgctg ctgctactac 60 

tgctgctgct ctggccagtg ccaggcgccg gggtgcttca aggacatatc cctgggcagc 120 

cagtcacccc gcactgggtc ctggatggac aaccctggcg caccgtcagc ctggaggagc 18 0 

cggtctcgaa gccagacatg gggctggtgg ccctggaggc tgaaggccag gagctcctgc 24 0 
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ttgagctgga gaagaaccac aggctgctgg 
cagatgggca gccagtggtg ctggccccca 
gagtaagggg cttccccgac tcctgggtag 
tgatcaccct cagcaggaat gccagctatt 
aggacttctc aacccacgag atctttcgga 
gtggccacag ggatcctggg aacaaagcgg 
gcaggggcag gcgagaagcg cgcaggaccc 
accacaccct gttcttgact cggcaccgaa 
aagtcgccaa ctacgtggac cagcttctca 
gcctggaggt gtggaccgag cgggaccgca 
tctgggcctt cctgcagtgg cgccgggggc 
agctgctcac gggccgcgcc ttccagggcg 
tgtgccgcgc cgagagctcg ggaggcgtga 
ccgcagccac catggcccat gagatcggcc 
gctgctgcgt ggaggctgcg gccgagtccg 
acccgtttcc gcgcgtgttc agcgcctgca 
aggggggcgg cgcttgcctc tccaatgccc 
tctgcgggaa cggcttcgtg gaagcgggcg 
gccgcgacct ctgctgcttt gctcacaact 
acggggactg ctgcgtgcgc tgcctgctga 
tgggtgactg tgacctccct gagttttgca 
tttacctact ggacggctca ccctgtgcca 
gtcccacgct ggagcagcag tgccagcagc 
aggcctgttt ccaggtggtg aactctgcgg 
gcgagggcca cttcctgccc tgtgcaggga 
agggtggaaa gcccagcctg ctcgcaccgc 
tagatggcca ggaagtgact tgtcggggag 
tgcttggcct gggcctggta gagccaggca 
gcaggcgctg caggaagaat gccttccagg 
gccacggggt ttgcaatagc aaccataact 
tctgtgacaa gccaggcttt ggtggcagca 
atgacacctt cctgctggcc atgctcctca 
gcctggcctg gtgttgctac cgactcccag 
gcagaaggga ccctgcgtgc agtggcccca 
gcggcgttca ccccatggag ttgggcccca 
ctgagaactc tcatgagccc agcagccacc 
acccccaaga tcaagtccag atgccaagat 
tgaacagatt taaagacagg tggccactga 
cagagccagt gaatcaccgg acctccagca 
agtggagctt cgacccaccc actccaggaa 
tggagaacac tgctgggcac actctccagc 
tcacacagcc cctgacctcc ctcaccagtg 
ggctctgtcc tgggagtctg gtgtgtctcc 
ggagggcatg actgctggcc agaagctagt 
cacactcccc tgcagcctgg ctggcctctg 
cctgtttctt cccaccctgt cttctcccct 
cagtgctaca cctgaggttc tggagctcag 
tgtcgggggg acagagggaa ccatttaaga 
acatgttggc tataggcgtg gtggctcatg 
gtaggaggat caccagaggc cagcaggtcc 
gcatctacag aaaaatttta aaattagctg 
tgctcaggag gctgaagcag gaggatcact 
tggtggcacc actgcactcc agcctgggtg 
tttaaaaaga cataaaaaaa aaaaaaaaaa 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 



ccccaggata catagaaacc cactacggcc 300 
accacacgga tcattgccac taccaagggc 360" 
tcctctgcac ctgctctggg atgagtggcc 420 
atctgcgtcc ctggccaccc cggggctcca 480 
tggagcagct gctcacctgg aaaggaacct 54 0 
gcatgaccag ccttcctggt ggtccccaga 600 
ggaagtacct ggaactgtac attgtggcag 660 
acttgaacca caccaaacag cgtctcctgg 72 0 
ggactctgga cattcaggtg gcgctgaccg 780 
gccgcgtcac gcaggacgcc aacgccacgc 840 
tgtgggcgca gcggccccac gactccgcgc 900 
ccacagtggg cctggcgccc gtcgagggca 960 
gcacggacca ctcggagctc cccatcggcg 102 0 
acagcctcgg cctcagccac gaccccgacg 1080 
gaggctgcgt catggctgcg gccaccgggc 1140 
gccgccgcca gctgcgcgcc ttcttccgca 12 00 
cggaccccgg actcccggtg ccgccggcgc 1260 
aggagtgtga ctgcggccct ggccaggagt 1320 
gctcgctgcg cccgggggcc cagtgcgccc 13 80 
agccggctgg agcgctgtgc cgccaggcca 144 0 
cgggcacctc ctcccactgt cccccagacg 1500 
ggggcagtgg ctactgctgg gatggcgcat 15 6 0 
tctgggggcc tggctcccac ccagctcccg 162 0 
gagatgctca tggaaactgc ggccaggaca 168 0 
gggatgccct gtgtgggaag ctgcagtgcc 174 0 
acatggtgcc agtggactct accgttcacc 1800 
ccttggcact ccccagtgcc cagctggacc 1860 
cccagtgtgg acctagaatg gtgtgccaga 1920 
agcttcagcg ctgcctgact gcctgccaca 1980 
gccactgtgc tccaggctgg gctccaccct 2040 
tggacagtgg ccctgtgcag gctgaaaacc 2100 
gcgtcctgct gcctctgctc ccaggggccg 216 0 
gagcccatct gcagcgatgc agctggggct 222 0 
aagatggccc acacagggac caccccctgg 2280 
cagccactgg acagccctgg cccctggacc 2340 
ctgagaagcc tctgccagca gtctcgcctg 2400 
cctgcctctg gtgagaggta gctcctaaaa 2460 
cagccactcc aggaacttga actgcagggg 2520 
cc tgcaggca gcttggaagt ttcttccccg 2580 
cccagagcca cattagaagt tcctgagggc 2 64 0 
tcaataaacc atcagtccca gaagcaaagg 2 700 
gaggctgggt agtgctggcc atcccaaaag 27 60 
tacatgcaat ttccacggac ccagctctgt 2 82 0 
ggtcctgggg ccctatggtt cgactgagtc 2880 
caaacaaaca taattttggg gaccttcctt 294 0 
aggtggttcc tgagccccca cccccaatcc 3 00 0 
aatctgacag cctctccccc attctgtgtg 3060 
aaagatacca aagtagaagt caaaagaaag 312 0 
cctataatcc cagcactttg ggaagccggg 3180 
acaccagcct gggcaacaca gcaagacacc 3240 
ggcgtggtgg tgtgtacctg taggcctagc 33 00 
tgagcctgag ttcaacactg cagtgagcta 3360 
acagagcaag accctgtctc taaaataaat 3420 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3480 

3509 



<210> 4 
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<211> 826 
<212> PRT 
<213> Homo sapiens 



<400> 4 

Met Gly Trp Arg Pro Arg Arg Ala Arg Gly Thr Pro Leu Leu Leu Leu 
15 10 15 

Leu Leu Leu Leu Leu Leu Trp Pro Val Pro Gly Ala Gly Val Leu Gin 
20 25 30 



Gly His lie Pro Gly Gin Pro Val Thr Pro His Trp Val Leu Asp Gly 
35 40 45 

Gin Pro Trp Arg Thr Val Ser Leu Glu Glu Pro Val Ser Lys Pro Asp 
50 55 60 

Met Gly Leu Val Ala Leu Glu Ala Glu Gly Gin Glu Leu Leu Leu Glu 
65 70 75 80 

Leu Glu Lys Asn His Arg Leu Leu Ala Pro Gly Tyr lie Glu Thr His 
85 90 95 

Tyr Gly Pro Asp Gly Gin Pro Val Val Leu Ala Pro Asn His Thr Val 
100 105 110 

Arg Cys Phe His Gly Leu Trp Asp Ala Pro Pro Glu Asp His Cys His 
115 120 125 

Tyr Gin Gly Arg Val Arg Gly Phe Pro Asp Ser Trp Val Val Leu Cys 
130 135 140 

Thr Cys Ser Gly Met Ser Gly Leu lie Thr Leu Ser Arg Asn Ala Ser 
145 150 155 160 

Tyr Tyr Leu Arg Pro Trp Pro Pro Arg Gly Ser Lys Asp Phe Ser Thr 
165 170 175 

His Glu lie Phe Arg Met Glu Gin Leu Leu Thr Trp Lys Gly Thr Cys 
180 185 190 

Gly His Arg Asp Pro Gly Asn Lys Ala Gly Met Thr Ser Leu Pro Gly 
195 200 205 

Gly Pro Gin Ser Arg Gly Arg Arg Glu Ala Arg Arg Thr Arg Lys Tyr 
210 215 220 

Leu Glu Leu Tyr He Val Ala Asp His Thr Leu Phe Leu Thr Arg His 
225 230 235 240 

Arg Asn Leu Asn His Thr Lys Gin Arg Leu Leu Glu Val Ala Asn Tyr 
245 250 255 



Val Asp Gin Leu Leu Arg Thr Leu 
260 

Leu Glu Val Trp Thr Glu Arg Asp 
275 280 



Asp He Gin Val Ala Leu Thr Gly 
265 270 

Arg Ser Arg Val Thr Gin Asp Ala 
285 
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Asn Ala Thr Leu Trp Ala Phe Leu Gin Trp Arg Arg Gly Leu Trp Ala 
290 295 300 

Gin Arg Pro His Asp Ser Ala Gin Leu Leu Thr Gly Arg Ala Phe Gin 
305 310 315 320 

Gly Ala Thr Val Gly Leu Ala Pro Val Glu Gly Met Cys Arg Ala Glu 
325 330 335 

Ser Ser Gly Gly Val Ser Thr Asp His Ser Glu Leu Pro lie Gly Ala 
340 345 350 

Ala Ala Thr Met Ala His Glu lie Gly His Ser Leu Gly Leu Ser His 
355 360 365 

Asp Pro Asp Gly Cys Cys Val Glu Ala Ala Ala Glu Ser Gly Gly Cys 
370 375 380 

Val Met Ala Ala Ala Thr Gly His Pro Phe Pro Arg Val Phe Ser Ala 
385 390 395 400 

Cys Ser Arg Arg Gin Leu Arg Ala Phe Phe Arg Lys Gly Gly Gly Ala 
405 410 415 

Cys Leu Ser Asn Ala Pro Asp Pro Gly Leu Pro Val Pro Pro Ala Leu 
420 425 430 

Cys Gly Asn Gly Phe Val Glu Ala Gly Glu Glu Cys Asp Cys Gly Pro 
435 440 445 

Gly Gin Glu Cys Arg Asp Leu Cys Cys Phe Ala His Asn Cys Ser Leu 
450 455 460 

Arg Pro Gly Ala Gin Cys Ala His Gly Asp Cys Cys Val Arg Cys Leu 
465 470 475 480 

Leu Lys Pro Ala Gly Ala Leu Cys Arg Gin Ala Met Gly Asp Cys Asp 
485 490 495 

Leu Pro Glu Phe Cys Thr Gly Thr Ser Ser His Cys Pro Pro Asp Val 
500 505 510 

Tyr Leu Leu Asp Gly Ser Pro Cys Ala Arg Gly Ser Gly Tyr Cys Trp 
515 520 525 

Asp Gly Ala Cys Pro Thr Leu Glu Gin Gin Cys Gin Gin Leu Trp Gly 
530 535 540 

Pro Gly Ser His Pro Ala Pro Glu Ala Cys Phe Gin Val Val Asn Ser 
545 550 555 560 

Ala Gly Asp Ala His Gly Asn Cys Gly Gin Asp Ser Glu Gly His Phe 
565 570 575 

Leu Pro Cys Ala Gly Arg Asp Ala Leu Cys Gly Lys Leu Gin Cys Gin 
580 585 590 
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Gly Gly Lys Pro Ser Leu Leu Ala Pro His Met Val Pro Val Asp Ser 
595 600 605 

Thr Val His Leu Asp Gly Gin Glu Val Thr Cys Arg Gly Ala Leu Ala 
610 615 620 

Leu Pro Ser Ala Gin Leu Asp Leu Leu Gly Leu Gly Leu Val Glu Pro 
625 630 635 640 

Gly Thr Gin Cys Gly Pro Arg Met Val Cys Gin Ser Arg Arg Cys Arg 
645 650 655 

Lys Asn Ala Phe Gin Glu Leu Gin Arg Cys Leu Thr Ala Cys His Ser 
660 665 670 

His Gly Val Cys Asn Ser Asn His Asn Cys His Cys Ala Pro Gly Trp 
675 680 685 

Ala Pro Pro Phe Cys Asp Lys Pro Gly Phe Gly Gly Ser Met Asp Ser 
690 695 700 

Gly Pro Val Gin Ala Glu Asn His Asp Thr Phe Leu Leu Ala Met Leu 
705 710 715 720 

Leu Ser Val Leu Leu Pro Leu Leu Pro Gly Ala Gly Leu Ala Trp Cys 
725 730 735 

Cys Tyr Arg Leu Pro Gly Ala His Leu Gin Arg Cys Ser Trp Gly Cys 
740 745 750 

Arg Arg Asp Pro Ala Cys Ser Gly Pro Lys Asp Gly Pro His Arg Asp 
755 760 765 

His Pro Leu Gly Gly Val His Pro Met Glu Leu Gly Pro Thr Ala Thr 
770 775 780 

Gly Gin Pro Trp Pro Leu Asp Pro Glu Asn Ser His Glu Pro Ser Ser 
785 790 795 800 

His Pro Glu Lys Pro Leu Pro Ala Val Ser Pro Asp Pro Gin Ala Asp 
805 810 815 

Gin Val Gin Met Pro Arg Ser Cys Leu Trp 
820 825 



<210> 5 

<211> 207433 

<212> DNA 

<213> Homo sapiens 

<400> 5 

gctctaataa atttgcggcc gctaatacga ctcactatag ggagaggatc cgcggaattc 60 

ccccatgtgc catgtccacg agatggttga acagatgaga caacatggtt gtgcagcttc 120 

tctctttttt tttttttcag acagagtctc actctgtagc ccaggctgga gtgcagtggc 18 0 

gcaatttcag ctcactgcaa cctccgcctc ccaggtcctg attcaagcag ttctcctgcc 240 

tcagcctcct gagtagctgg gattacagga acgcgccact atgcccagct aatttttcta 3 00 

ggtttagtag agacggggtt tcaccatgtt gaccaggctg gtcttgaatt cctgaccttg 3 60 
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tgatccgccc gcctcggcct cccgaagtgc tgagattaca ggcatgagcc accgcatccg 420 

gccatgcagc ttctcttttc tagagttaac aggagatgcg gcaggtctgt agtagcccag 4 80 

aaattctcag acttgcactc ttgaatttac acgcatgtac aaaatgcagc tcagcctaat 54 0 

gcctcctgtt cagttctgtg gtgctgccta cttcctgtgc ccaaattagc cttgcgttta 600 

taactaggag gaatgacttg gtgtcctaca atattttggc tcctgggctt ctttggtgtt 660 

cattctatta ctagttgatt ttttttttct tggaaagaaa agtattcaaa gagccagaat 720 

ttactttcat tatatcttac agttgttaaa ttaacctatc cttgttttca ctttctgtgt 780 

acttctttct ttggggtaca aaacagtgtt tctttagatt gtctatttct aaatattatt 840 

ttaaccagaa cataccaaat atattttcct gccagtttca ttcttcattt tacttcttaa 900 

ccattgttac gatttttttt ttaacttcgg tctcagattt tgctcaattg aagttgtccc 96 0 

agttcatgag attttgtttt ctttgcagct cttgaccatg acagatgtga ccagcacaca 1020 

gattgttaca gctgcacagc caacaccaat gactgccact ggtgcaatga ccattgtgtc 1080 

cccaggaacc acagctgctc agaaggccag gtcagaggct gtttcttaaa gatttcagaa 114 0 

aaatcccaat ttgtcatagg tttagctttt atagtgtata tggtataaat aatggcccag 12 00 

agttactttt caaatgggtt tctatttgga ttttattatc cctgaggttt tcctttagga 1260 

agagatgttc tgtatatttc aagggtccta ctagccctgg gaatttgtgt attgtgtttt 1320 

agaagaaggt aggactgttg tccctgggca tggctgtgac taagcactgg atccttggtt 13 SO 

tgaggcatct gatagtgacc tcactttacc agtaccaggt tttgataaag ttagggtttg 144 0 

agagtgagtc aggtgtacag cggcccatat tgaactgtgg aaatgacaga ctttgcaaaa 1500 

tctccttgtt ttatatcttg gtttggcata atctcactgc tcatcatata tagattttta 156 0 

aatttaagtt ataagatgac ccaggctttt atagtttttg acagcttaca agactttttt 1620 

ttttgtcagt ctcatacagt tcataaaata gaaaactttc agacttttga agtgagcatt 1680 

tgaaaagcac caagttcaac attcccattt tacagatgag caggttgagg catggtgttt 1740 

aaaagagtgg gctgcttcct tgccagttaa aagcatgtcc ttgagcagaa cattccattg 18 0 0 

cagtgttccc cactgcagag ctactgcaac ttatctatct atctgtctgt ctgtctgtct 1860 

atctatctat ctatctatct atctatctat ctatctatct gtctatctat ttatttgcag 1920 

tttgcttcct caattagatt ttgttgtcta taatatcctg ttggtcctca ccataagttc 19 8 0 

tctctgtgga tacaaaagaa aagcttttgt ccactgtgtt ctagatctcc atttttaggt 2 04 0 

atgagaattg ccccaaggat aaccccatgt actactgtaa caagaagacc agctgcagga 2100 

gctgtgccct ggaccagaac tgccagtggg agccccggaa tcaggagtgc attgccctgc 2160 

ccggtaggcc ttgcagggtc atcttggtgt gtgtgggtcc attacttcag cctgcttccc 2 22 0 

ccaacactgt gcagcctaag ttgaacctag cagaggggaa gagctaattc tgtccattca 2280 

tcccccacac gagtattatg ggcttttttg tttttaacta aaatacagtt cttaagtatt 2 34 0 

tgttcctact gtcctttgaa ataaagtgaa acatcctttg ctgctctgta gaattgagtg 24 0 0 

acagcttggt tctgtatcac tgagcctgct ccttgtctct ttcccatcca cctttgataa 246 0 

cctgggacta gaccatgatg tctcagcagt cagtctgctg agcactttat ggagagtact 252 0 

tcttattaac cactgggatt taattgttgg cacctgctaa tgggccttct ctgagaagga 2 58 0 

gaggatagat acttctgtca gcagcacctt ttaggggtga tctccagccc tgaaaacctc 2 640 

aatatatcct gcttctgagg ttcaggatga tgaactcagg gcctgagacc agccagccat 27 0 0 

gtgatatatt tggaccaggg tggtccagaa aggggaactt ccttgtctgt gcacaacatc 2 760 

tgagcttgtt cagggaagtt ggtttggacc aggccctttt tgaatgtccc ttggaagttt 2 82 0 

ttgattcagc ttgcagagtg ggactctttt ctatatctca gtgtgtctca aattttaagg 2 880 

tacaggaaaa gacctgggaa tcttgcgaaa ggcatattct ggtgcaagtg tggggtgggg 2 94 0 

cctagagtgt gcattttatc tagtcgactg ctacactgtg aggagcaagt ctttgtctgt 3 0 00 

tcttaaatga tctctttccc atggtacctt tcttttatct cagtgactgt tactgttaat 3 060 

gaacattgtt gatgtctcca aagtactttg gttctggtga agttgctttg ttctttcatt 3120 

gtcttccagg aagcattcat agcttctgtg cagtaccttg tgtgggttca ggatgatcac 3180 

aggtagcaga ttacaagctt gtcttgtatg ctatagccat atcacttggg ttgtttctca 3240 

agaaggacct tctcaccttg cttttgggat gctttgtaca cttgattgta ccttccacct 3 3 00 

gatgatatga aaacagtgca gcttttggag actatagatt tgttaattcc ttgattcatt 33 60 

tccattcttg cagtttttac cccagccctc caatatgcat attcatttgt ctgctcttca 3420 

cttaggattt tagttttcta attgttcttc agaaggaagt gtaccagtct aatattggca 34 80 

ccaaactggt gttttcatct aagacatagg ataagtgacc tcagaatatg ctttttagga 3 540 

tccgggagat atcaccagta aacattttaa aattcttgta ttctgcattt ggtccttaat 3 600 

aatgtgtcag aggctcccac atcctaatga agtacctaga atttaaatta gaaaggccat 3 6 60 

ttcggtattc agtaatttga actcataata cagtagtttt gtctgatttc taaaattctt 3 72 0 

ttctttctct tttcccctta atgaaagaaa atatctgtgg cattggctgg catttggttg 3 780 

gaaactcatg tttgaaaatt actactgcca aggagaatta tgacaatgct aaattgttct 3840 
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gtaggaacca caatgccctt ttggcttctc ttacaaccca gaagaaggta gaatttgtcc 3900 

ttaagcagct gcgaataatg cagtcatctc agagcatggt gagttaaaat cctcaaaact 3960 

taagtttctg gttatccacc tttctaccaa gggcatgact gcagcttgca tgtggaaggc 4 02 0 

tgtggatatg tgtaacgtgc ttggcaagaa ggggagtgct ggtgaacgca gcctgaggga 4080 

ctgtgggttt gtgctgtcag agtctcttcc tcttaaaatt tttaatactt tgtatatata 4140 

agate tatga ataattatat gggggatgaa ttgtaacatg tatatgtgta cataatctgg 42 0 0 

tgacatcagt agattatttc atacctgttt tacctctgga ttctgetagg ggagaaagag 4260 

aggtcactga taattagcta ggttggatta agccacctga gttccttgga gttaaggtat 4320 

tataatagtg cataagactg tataattacc actaagaagt gtacatctca gctggatgtg 43 80 

gtggctcaca cctgtaatcc cagcactttg ggaggctgag gtgggtggat cgcctgaggt 444 0 

caggagttca agaccagcct gaccaatata gtaaaacccc gtctctacta aaaatacaaa 4500 

aattagccag gcatggtggt gtgcgcctgt aatcccagct actcaggagg ccgaggcagg 4560 

agaattgett gaacccagga ggtggaggtg gcagtgagct gagattgege caccgcactc 4 62 0 

cagcctgggt gaeagagega tactccgtct caaaacaaaa aaagaacagc aaaaaaaaga 468 0 

aatgtacatc tccttgggtc tttcatagee cagccatctc aaaagaagag agcaccttct 4 74 0 

tgtcaagagt ttctaagccc agaaaggctc aagttctctg cttgtccacc cagtgctctc 4 8 00 

agggggctta tagtcaatat tccatgatca cattttgtca tttttagtct ggagtcataa 4 860 

attgtgatcc tacegtcagt taagtagact catagaacaa agctctttca gcagtttcag 4 92 0 

ctgtggtaca gaaacgttta gtggaaatgt tcttaccaag cagggaggag ttgagggcaa 4 980 

cacttccttg ggtacagect ccttcatgtg aaggtatgga aatgtggcct gggtctegge 504 0 

tgcctgtggc ctctgtgtac cacctatagg gecattctga gactggtagg aggtgccctg 5100 

tatttagttt tctccaatta gtcccttttt teagtgeaac ttagatgggg tatggacact 516 0 

caaacattgg tgacatattc ttagtgtgtt tacctcaggc tactgtgacc acatttggta 5220 

tttcataata ttttgatagc tttttcagat ttcagaatct ccattggtga ctgtctctgt 52 80 

tgtttctctt tccatgtcca aatgtgggtc tcttccagca ttccatcctt ggctggcagc 534 0 

tgacctttcc catagttatt cactccctca gaaaatggat ggcacccagc tttgettatt 54 0 0 

accctggtgt ctctaacaat gactctcgag tecaeggaag ttaaaagggt tcaaccaggt 54 6 0 

ggecacagat atacttctgg taccctttct ctcttcctta ggttttctaa etctaaaegt 5520 

ttctgggtat tetaatctge tgtggccacg tttatgaaac agaaattcac agtcttaatg 558 0 

ataagactga cagatgagag acaactgaag tgtaatgtcc ttccacagct atacctctag 5 64 0 

atgtagccca gttagtagag cccagatttt tatggaaaaa caagaaagga cacctagcct 5700 

aacccttagg acagaggctg gctggtagaa agctggaagg aggtgacccc tgcaggtgag 5 760 

aaggagtgaa ttaggatgtc gaagacggaa gggctttctg tgatttaatt agtgccccca 5 82 0 

tctgtgagat gtagagggag atgattaagg gagtggctct ttgagtgagc tgcaggtaag 588 0 

tttgcatggt tggctgeagg ctggatggga ggggattttt a tag tt gage ctcaggaagg 5940 

aaccaaggcc agaccctgcg aggecatgge tacaatagta atggatttga attgtatcgt 600 0 

gaaggcaaag aaattattga agggecttta aaaagtattt ttaattgtct ttctttcttt 6 06 0 

tggatctgtt tatttttagg tcttttagta ggcatgtgta ttttttcctc tcaaaaatgg 612 0 

aaataggctg ggtgtggtgg ctcctgcctg taatcccagc attttgggag gecagtgagg 618 0 

gaggattget tgageccagg agttcaagac tagectggge aacacaggga gacctcgtct 6240 

ctacaaagga agttttttta aagaattagc egggcatgat ggtggcacat acttgtagtt 6300 

ccggctactt gggaggctga ggcaagaggg ttgettgage ccaggggttt gaggctgtag 636 0 

tgagccatga tcatgtccct gcactccagc atgggcaaca gagegagace ctgtctcaaa 642 0 

agaaagcaaa gggagggaaa tacagtatat cttttgtttt ataactacca aaattaggaa 64 8 0 

tacttaccat ttcttggcta aactttatat tttgattttt aaaacttgtt aaaaattgea 6540 

atgagaagga aatttcagga gagcagaaga cagactgtcc caggtgtcac tgtcctatta 6600 

ttccctataa aatccagtgc caggatggat gaatggataa agcaaatgtg gtataagttg 666 0 

aatatccctt atctaaaatg ctttggacga ggagtgtttc gaattttaga atatttgeat 672 0 

tatactaacc agttaagcat ccctaatctg gaaatccaaa atgctctagt gaacattttc 6780 

cttcagcatc catcatgttg gcactcaaaa agttttagat tttggagtgt tttggatttc 6840 

agattaggga tactcagcct gtgtttgggg gtagecatet cttcatatag acatttcaga 6900 

acttaaatat tgetttgeta taatttctgt gaatttttga tatattatct tctctgagct 6960 

acatttttat cctttataaa atggccatat tgaagtgatg atctatccta atctaccatg 7020 

gctgagtcaa gggataaaga ggttttcctg tgtctgtggg gtatacttaa cttggtggtt 7080 

tttatctaga agcttgtttt ggtcaagatg ttggttatat teaggecagg catggtggct 7140 

catgectata atctcaacat tttgggaggc caaggtggga ggatcacttg agctcaggag 7200 

tttgaaacta gecagggcaa catggcaaga ctccatctcc aaaatttaaa ttaaaaaaag 7260 

atactatctg tattcatagt tgtgtctctt ttgectttag tccaagctca ccttaacccc 7320 
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atgggtcggc cttcggaaga tcaatgtgtc ctactggtgc tgggaagata tgtccccatt 73 80 

tacaaatagt ttactacagt ggatgccgtc tgagcccagt gatgctggat tctgtggaat 7440 

tttatcagaa cccagtactc ggggactgaa ggctgcaacc tgcatcaacc cactcaatgg 750 0 

tagtgtctgt gaaaggcctg gtaagttcac aggtgaatta ggtggtattc agagtttatt 7560 

gtgagagaaa ccataggagg catagttcat tgctgagatg tgtgaagtag tcatgaaaac 762 0 

agatgaagta ttgatttcaa gcatgcaaag aagagtataa cccagatttc agaagcagaa 768 0 

ggaaatattc tgggaccctg aatagtttta attataagca aaactaaaaa taactaacac 774 0 

tactcgaaga aactgatatt ctaattaaca atgagattga taggtttatt gaccagaaaa 78 0 0 

agtattgaga attgctctga aaagcaaatt tattggtgtt ggcagagaaa tgctgtggaa 7860 

gaagaaacaa aaaaggaaaa taaaaccaaa gaaaagatta gtaaagcaag caagtgactg 792 0 

cagggacagt gttcagaaag gtagtgtcaa cagggagaaa aatgatgaga gcagttctgt 798 0 

aagcgaggac agaagcaacc cagaactacg cgagagctgc aggagagtac tgagcagaca 804 0 

gacactcgga gtagcttccc agctttcagt ctctctgggc tacattttgg ctaactaaag 810 0 

gcaggcccag gggtagctgc tgcagcatga agacaactca aaataagcct cctcctcctg 816 0 

cagagggact cacaagcagt cggtggaatt gtgggttatt ttgggaggct ggactccatt 822 0 

tgcattgtgt ccaattttgg gaaagtaatt tgtctgagga attacagagg tataggagag 82 80 

aattcatagc attgtagatt tctaaatatt gatttctaaa cttctctagg ggagctgccc 834 0 

agcagccttt cagaattctg aatcctttgt tgaacttaat gaatgccgta gaccttcttc 840 0 

cctccaaaat tgcaaacatg agattttaca tattcaagtg gattgtaaaa cccctaaagt 846 0 

tcatcctggg tccccaaatt ctaaggggct tacagcccta cttttatgga aagattcact 852 0 

gtttcctcat ctttgtcctt aatgctcttt gaaaagaaaa tttatttttt ccacaagtgt 8580 

gtaaatacac ttgactaaac agtacttgat gacttttatt gttttctatt ttctctcatt 864 0 

taattgattt catggcacat taatgggtca tcacccacat ttgaaaagtc ttggctgggc 870 0 

acagtggctc acgcctgtaa tcccagcact ttaggaggcc aagacgggtg gatcacaagg 87 60 

tcaagatatc aagaccatcc tggctaacat agtgaaaccc catctctact aaaaatacaa 8820 

aaaattagcc agacgtgatg gcgcacccct gtagtcccag ctacttggga ggctgaggca 888 0 

ggagaatcac ttgaaccggg gaggcggagg ttgcagtgag ccgaaattgt gccactgcac 894 0 

tctagcctgg agacagagtg agactccgtc tcaaaaaaaa aaaagaaaag aaagaaagaa 90 0 0 

aagtctttga ccttagcgga catggtggga gctctaagtg tctctcttgg gtttcattcc 90 60 

cagcaaacca cagtgctaag cagtgccgga caccatgtgc cttgaggaca gcatgtggag 912 0 

attgcaccag cggcagctct gagtgcatgt ggtgcagcaa catgaagcag tgtgtggact 9180 

ccaatgccta cgtggcctcc ttcccttttg gccagtgtat ggaatggtat acgatgagca 924 0 

cctgcccccg taagtgaaaa agggagccct aggcacttat gcatgccctc tgtataggca 930 0 

acaactcagc catgaggctg tgctgtcagc ctctgaacat tttagaaaca agactggaca 936 0 

tgacctctgc tcaaacctga ccagagactg ccatcgagac cttgctgcct attgagaacc 942 0 

ttcatacaga atcaggcaca ttgacagtaa ataaatgtaa gatagatcac agagtacaga 948 0 

aataacttgt ccaacttcag tgtcatattg ctcaatccat gtaatatctc catatctgaa 954 0 

tttcctaatt tgtaaagtaa atgctttcca atagataatc tctaaggtcc cttttgcctt 96 0 0 

caacatcctg ggattgagag aggagggaag ggtcatctct gttatgtatt gggcaaaata 9660 

ctgggctctt tacattcatt atctctttta aataatcaag acagaataat atttttgact 972 0 

caagccagtt gaatagtctg ttaaaaaaaa agtaaataca gtgaattcag atctacctgt 9780 

gatagtcaat tgcaactttt tttttttaat agctgaaaat tgttcaggct actgtacctg 9840 

tagtcattgc ttggagcaac caggctgtgg ctggtgtact gatcccagca atactggcaa 990 0 

agggaaatgc atagagggtt cctataaagg accagtgaag atgccttcgc aagcccctac 996 0 

aggaaatttc tatccacagc ccctgctcaa ttccagcatg tgtctagagg acagcagata 1002 0 

caactggtct ttcattcact gtccaggtaa gatgccttgc atatccaaat tcaagtgttt 10080 

cactactgat ttatgaagaa taaaaccttg aaagctacgt tgtgtatatg taactccctg 10140 

ccctcagccc ctttccttcc tcctaatggt tggtacaaga aggaatagac cagaagctgg 102 0 0 

tccaaggcct gacctggacc tgctgagagt ggtggtgggt tcctaagaaa ccaattctaa 10260 

gaaattggcc tttgattcag acttgaagtg accactcagc aatgtgtctg tgggtttcta 1032 0 

gaacagttgg gagaggctgg gctggtgcaa agactcctca gagattagca gtcaagaact 103 80 

tctctaagag cctgccattg acaacagggc tgtttgtgag gactttgtaa gggaaagtcc 10440 

actgtaaaca aagctaaaag ggcagagaca gactgggaga aaatacctgc actgcatgta 10500 

acaactgatg atcatccaga atatgtaaac tcccaccctc cagggaaaaa ataagaagct 1056 0 

aaatgtgaac ccaacagaaa aagtggttat tggagataaa gaagccattc tcagaaaagg 1062 0 

aaatagaaca ggaaaatgta aactaacacc aggaaccaat tttttgtcta ctaaactgga 10680 

tcaaattttc cgtgttccct tttttccata ctaagatatg ggggacctgc aattctattt 10740 

ttaatctgct aggagttaac tttttaacga aatatttaaa tctctgcttt ttcatgaata 10800 
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tcacgaatat atctggtaaa atgacaaccc agagaatggg agaaaatatt tgcaaagtat 10860 

atatctaata agaatccaat gtccagaaca cgtaactctt aaaactcaac aatagaaaga 10920 

caacccaatt aagaaatgga taaaggattt aaatagacat atgtccagag aagaaataca 10980 

aatggccaat aagcacatga aaagatactc aatatcattc atcattacca agaaatgcaa 11040 

gtcaaagcca caatgagata ccactttaca cccactgaga tggctgtaat caataaaaca 11100 

ggtaataaca agtattggaa ggatatgtag aaattggaac tctcatgcag gttggcaggt 11160 

cctcaaaaag ttaaatatag agttatcata cggcccagca gttttactcc taggtacata 11220 

cccaagaaaa ttgaaaacat atgttaccca aaaacttgta tataaatgct tatgcttata 1128 0 

gcatcattct tcatagcatc atgctcaaag caacattatt cataataagc aaaagtgaaa 1134 0 

acaagccaaa tacctgttag ctggcaaatg gatgaacaaa gtgttgtata ttcatacagt 114 00 

gtaatgttat ttggcaataa aaaggaatga agtactgaca tattttctga catggattac 1146 0 

cctaaaaaaa catgctatgt aaaagaagcc aggtgcaaaa gattatgcgt tgcatgatgc 1152 0 

catttatatg aaatgtccaa agacagaaag tagatgttta gtggtttcct agggctgggc 1158 0 

atggtaatga agaataatag gcatgaggtt ttctagagta gctgcagcat tattttctga 1164 0 

catggattac cctaaaaaaa atgctatgtg aaagaagcca ggtacaaaag attatgcatt 11700 

gcatgatgcc atttatatga aatgtccaaa ggcagaaagt agatgtttag tggtttccta 11760 

gggctgggga tggaaacaag gaataatagg catgaggttt tctagagtag ctgcagtatt 1182 0 

attttctgac atgaattacc ctaaaaaaac atgctatgta aaagaagcca ggtacaaaag 118 8 0 

attatgcatt gcatgatgcc atttatatga aatgtccaaa ggcagaaagt agatgtttag 1194 0 

tggtttccta gggctgggga tggaaacaag gaataatagg catgagattt tgtagagtag 12 000 

ctgcaccatt ttatgctctc accaacatca tatgaaggtt catatctttc cacatccttg 12060 

ccaatacttg ttactatctt tttaatgaaa actgttctag tggatgagaa atggcatctc 1212 0 

actgttgttt tcatttgtat tttcctgata actaatgaaa ttgaacatca acttcattag 12180 

ttagcctttt gtatatcttc tttggagaaa tatttagtca aattctttgc ccatttttca 1224 0 

gttatgttgt cttttttatt attgagttgt aagagttctt tatgaattct gaagtccctt 123 00 

attggatata ttatttgcaa atactttctc ccattatatg ggtttctttt cactttctta 12360 

atatgccttt tgaaatcaaa agttttcagt tttgattaag ttccatgtat cagtgtttta 1242 0 

ttttatccca tgtgcttttt ggtattgtat ctagaaaatc agtgcccaag taacccagga 124 8 0 

tcacatagat ttgctcctaa gttttctttg aaaagtttca tagatttgtg tgttacatta 1254 0 

ggtccttgat ccattttgag ttaatttttg tgtatggtgt gaggtagggg tccaaactct 12600 

ctttttgcct gtggatgtgg tcctgcacca tttgttgaaa agattttttt ttttttttaa 12660 

ccattgaatt ttcatggcac atttgttgca gatcagttga ctgtgaatct aggaactttt 1272 0 

ttctaagctc tcatttttgt gcagttgacc aatgtttctc cttttgccaa tattacactg 12780 

ttttgattac tgtggctttg tataagtttt aaaattggga aggctaagtc ctccaccttt 12 84 0 

gttcttattt tgcaagactg ttacagctat tctgggtttc ttgcatgttc atattaattt 12 9 00 

taggatgagc ttgccatttt ctgcaacaac aaaaagccaa ctggtttgat aaggtttgca 12960 

ttgaatctac tgaccaattt ggggtgtatt gcctgtttgt ttgtttgttt gtttgtttgt 13 02 0 

ttgtttgttg agagagggtc tcactctgtc acccaggctg gagcgcagtg gtgctatctt 13 080 

ggctcactgc agcctccgct ttcccaggct caagtgattc tccagcctca gcctctcgag 1314 0 

taactggtac tacaggcatg agctaccaac acccagctaa tttttatatt tttttagaga 132 00 

cagggtttcg ccatgttgcc caggctggtc ttgaactcct gagctcaaag caatcctcct 132 60 

ggcctttgcc tccctagttc tgggattaca ggtgtgagcc accacgcctg ggccccttgc 1332 0 

cattttaaca agggttaatc ttctagcctg tgaacctgga atgtcaattt attaccctct 13380 

ttactttctt ttggcattgt tttatagttt tcagtgtaca tcattgtctt gttccttttt * 13440 

ttaggggaaa ataattcagt ctttattaaa tatgatattt gtttagtttt ttttatacat 13500 

gccctttatt gggttgagaa agtttccttc tattctagtt ttttgagggt tttgtaatga 13560 

atgagtgcta gattttgtca aacacttttt ctgtgtctac ttagatgatc acgaagtttt 13620 

gttctttatt aatactatgt attacattaa ttaagaatgt taaaccagtt ttgcattcct 13680 

gggataaagc ccatttagtc atgatgtata cttttttatc atgatgtata tttttaatct 13740 

gctgttggat tctatttgct agttggggat tttgtgtgta cattcatgag ggggatataa 13 8 00 

gtttgtggtt ttcttctttt gtgattacta tctggttttc atagcagggt agtactggcc 13 860 

tcctagagta agttggaacg tattcctcct cctctattta ctggaagagt tcgtcaagga 1392 0 

ttagcattaa ttcttcttta gatgttgatt gaatt caeca gtgaaaccaa atgggcctgg 1398 0 

cctttttctt tacaggaaaa tttttaatta ttaattcaat ctgtttgtta tagatctatt 14040 

cagattttct ttttctgagt gagtcagttt tggtaatctg tttttctagg aatttgtcta 14100 

ttctcagtaa tcaaatttgt taacatacag ttegtagtat tttcttttgt tttttttttt 14160 

ttttttttga gacagagtct cactctttca cccagtctgg agtgcagtga cgtgatctct 1422 0 

gettactgea acctccgcct ccagggttca agtgattctc ctgcctcagc cccccaagta 14280 
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gctgggacta caggcgcacg ccaccactcc tagctaattt tttattttgt tttgttttgc 14340 

tttttttagt agagacaggg ttttaccatg tagaccagga tggtctcgat ctcctgacct 144 00 

cgtgatctgc ctgccttggc ctcccaaagt gctgggatta caggcgtgag ccaccacgcc 1446 0 

cggccagttc atagtatttt cttgcaggtt gctctttagc aatatgtagt ctcctgttta 14 52 0 

acctgttaac gatttttttt tctgtctaat ttggttttat tattctttta cattttgcaa 1458 0 

gggctctaaa tattgtgagg gttttttttt ttaagagctt gctctcttat attgtggatg 1464 0 

caataattta agccttattg aatgtactaa aattcgtatt tttcattctc tcatatttct 14700 

tgcattaacc cttccttcag gatttgttgc tgtgcgtgtt tatcctggtg gtattcggag 1476 0 

attgagagtt ttcttatact ggttaatcct caaggttaat ttgtattaat aactaaatgt 14 82 0 

gtagatctgt caatattggt cagtggttgg gtgtgctgag ctgttgtgaa agtgtggtat 14 88 0 

gatgtcctgg aaaagctagc aagccagcct caggacactc cagctctggt ccacttcgac 14 940 

tgttgttgac tggtgtgcgc atctccctca ttggaagaag gaaggagcaa tggacctggg 15000 

tggctgggtg caattagcat gttcatgggt gtggcaagtg cctcccagcc ttgggtggta 15060 

gggccaccta aagatagctg gcacattgcc ttgagttttc tttcattctt gcttgcttgc 15120 

ttgcttttat ttttgagatg gagtttcact cttgttgccc aggctggagt gcaatggtgc 1518 0 

tttcttggct cactgcaacc tctgcctccc gggttcaagc ggttctcctg cctcagcctc 1524 0 

ccaagtagct gggattacag gcacctgcca ccacctctgg ctaatttttt gtatttttag 153 00 

tagagacagg gtttcactgt gttggccagg ctgatctcga actcctgacc tcaggtgatc 15360 

cacctacctt ggcctcccaa agtgatggga ttacaggcat gagccaccac acccagctga 1542 0 

gttttctttc aaacactcaa atactaacag gtgcataaac agaacaagta caaggctatg 154 80 

gaaggttgcc aagatggtga aaaactgcat cacactgatc aaccaaggca gatctgaaga 15540 

atcagatttt gcttacagtg caggttgtgc actatgtaaa tgtttccccc atcctattcc 15600 

Gccctagagt tctgcagtgc acagcctgtg cagctgtcct gagcagtcct cagaggtcca 15 660 

agcttcatat acctttctag ttgtaaagcc ctatatccat taggagtctt agtaatccca 15720 

cttgtggaaa tgtttatccg taaagatggt tatttttcta ataatgaaaa atgtaaagca 15780 

aactaaatat tagaatataa gggaatggtt aaataactgt taagattcat caaaatcatt 1584 0 

ctaccatttg aagttttgtt tataattgat gaaataaggt ggtaaaatag ttgtgatcta 15900 

agttaagaaa ggtgaatatt aatatgaaca taattgtgca aattatgttc ttctggtata 15960 

gaaaaatacc agaagaaaat atatcaaatg tgaacagtgg tctctgagtg gttggattgt 16020 

atatgatctt tccccatcct tctatttttt aagtatactg tcatgagcat acgttatttt 16080 

ttagtgaata acaacaaaaa atatttttta atatcactat gggctttcac cttgctgtga 1614 0 

tacattttca attttcctgg actagctctt ctttgatatt ttttatctta tttctcaaat 16200 

tagtttttag tgggccagtt ttggtgattt atcaaagata aattaactac agagatagtt 16260 

gcagataaat cttattgaac ttaactacat gcagttcttc catggtacag accccttgaa 1632 0 

actctttttc ttgcagcttg ccaatgcaac ggccacagta aatgcatcaa tcagagcatc 163 80 

tgtgagaagt gtgagaacct gaccacaggc aagcactgcg agacctgcat atctggcttc 16440 

tacggtgatc ccaccaatgg agggaaatgt cagcgtaagt caaattggtc aggtttactc 16500 

atggcaaatc ggtgtggaac acagcacttc tatttgactt gaatatctag gaggaaaaaa 16560 

gccactttgt tttggatacc acatttctta ttaaatcata ctggtcagaa gctcagctga 1662 0 

gctggaagaa agaaccaagg tttcagtgta gctggttaaa gcaacaagcc aaaaatgtta 1668 0 

gagtcaagcc tttgaccaca ttctgcagtg gtataagcaa gggcccaaaa ccagagaaag 1674 0 

tgccaacttc ccctgtttcc tttcctccca ttgaacaggg cacaggtcaa tagtcagagg 168 00 

gat cage tea gacatgggag tcttggggct catctcactg aggcccggac aaaaggttcc 16860 

agcagtatat ggacactaag gttgtgggaa gggcaaggtg taagggctag gactgggaaa 16920 

gtcctgggtt ggagtcagag ttaggaaagt gtcttctgaa ttccccctcc tccccccatc 16980 

aggaggcett ggtgcctgaa gaagacctca gcccaaggcc tcagataggg agcctgggaa 1704 0 

taaaggctcc agggctagga atgtgaacat gtgaaggagc gtgacccgtg aggtagacct 17100 

gagtccttaa ctgcacctct ctcagagccc cacacttagg cccacctcac aggtcatget 17160 

ctgccaggta aagcctgtgg cattgectgt ggtcaggccc tgaaaaatca cacacgggct 1722 0 

ttttaaccag gagecagact cctcaatact tatagtgtat tttaaaggtt taaatagtcc 1728 0 

cgtgttggct gggtgtagtg gctcacgcct gtaatcccag cactttggga gggecaagge 1734 0 

aggtggatca cttgaggtta aggagtttga gaccagcctg accaacatgg tgaaacccca 174 00 

tctctactaa aaatagaaaa attagctggg catggtggcg ggggcctgta atccaagcca 17460 

ctcaggaggc tgaggcatga gaattgcttg aacctgggag gcggaggttg cagtgagcta 17520 

agatcacgcc actgcactcc agectgggea acagagtgag actcagtctc aaaaaaaaaa 1758 0 

aaaaagaaaa aaaaaaaagt cctgtgtttc cccatttatg tcaaatagaa gcctgcaagc 1764 0 

aataggacat tttattatga gaacaaaaat tatgacaagt gatattaatg aactgetttt 17700 

tttattctag tttccagcca aaaaacataa tgagtactat agtcgaaaga cttttcaaag 17760 
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ttctgagcag gaaaagtaaa caacataggg 
ccaaactctt tggcatggcc cttcagaatc 
gcagcgttct agctgtgtgt cccttactct 
ggtggttgtt aagatgaaag cagagcagca 
atacttagta gatgtgggca attcttatta 
agtaaaagtt cttgctcaga ttaaaaggta 
gaatagactt aatccgttag acagtaacca 
ggcattgtgc tgttacccac tcaagacaca 
actgcccagg gtctcacagt gagatgtggc 
accttaatta cttggctggt tgtgcttcca 
atctagaagc atcacatttt cctctgtgag 
tttttcagag tcctaatcca cggtagaaag 
ccaagtcttt gatgttttct tcttaaaatt 
gatttcttga atgcttactg tgtttgagac 
tgagataggt actgttgtta tccccatcat 
taagtagcat tcccagagtg acagaagagc 
cagatggacg taaggccttt gcttttcact 
agagccaagt gtcagacact catgattctg 
ctattctgtt ttatagtttt ctttgggtga 
ctctccagct gaatgccttt catttattca 
gttgaccaaa aaggctttca cttttttcaa 
tacgttggat gataaaagga gagatgttga 
attttctgta aaggcaaagg aaaaggtgaa 
agcgctgacc ttggcattgg atcctggttc 
tcgtggagaa ggcacctcag ccaagaccag 
cacctcccta ggggcagcag ctatgtctgt 
gatgcttcac aaatatttat tgaatgaatc 
attgaaaaga tcaaaatagt cattgcttca 
tcctgtcctt tgtttggttc ctttaatgtt 
tcatgaatct tacccattga atgctcgtac 
cagataaaat tgcttgtttt ggaaaaagtc 
tgacctctgt tcctttttct tctctgcagc 
caacaccaac acgggcaagt gcttctgcac 
gctgtgagta ccatactccc tggaccacca 
aaccccaccc tgagagccac ccacttccct 
cctgggttga ttagtttgaa attttgccca 
gaggaggtcg tcgagttagc agcaattcct 
aatgatactg acatttcctg ggttgtccat 
cttgattttt gtcatgtcaa gagattatac 
ggtgttttgg tgatggcatt tatttggcgt 
tacctttaac atttcccatt actctagcat 
ctggagaggg tttgaagtcc ccatccatga 
taaggtttca gacctggccc tgcctccact 
tccccccatc tgtcagtagt gggactagca 
aacaggaagg atagttcctc aatcaatgta 
aaatgcatgt atccgcattt actcattctc 
aaacactgag ctgattaatc attgtaaaca 
tgccaccagt gattgagaca tcattgctgg 
ttttagttta tatctgcata tatgtttact 
aagaaaagtc taatccaagg ttaaatttaa 
aatgtcatcg agcaaaatta atcactctta 
aggtggcatc aagtaagact ttcaagttcc 
tcatcaccct ggctaaaggg acacagcata 
cgccctttcc tcttcatgca tagctgttgt 
ctgggttttg ctgatgttta tagatgatct 
tatttttttt ctacagactg gtgaacactt 
ggtatttagt cttctctcac ttgtgtttaa 
ggaaagaagt gagaaattag tatttgaaaa 
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aaatctctac ctactcccca gatcccaact 1782 0 

tgatctgatc caagaccagc caggagagag 1788 0 

ccctggggtt cagtttcctc ctaataatag 1794 0 

catgacacat gttccacagg atactagtca 180 0 0 

cccttctaat gccattctcc actactccta 1806 0 

tttggtgaaa atgttttccc cactctttgt 1812 0 

gagtacctaa cagagtggtg gagtgtacaa 18180 

ggtgctttta ttatccccaa gatcaagtaa 1824 0 

caagctaaga accacgccca gtctgtgctg 183 00 

gatgccatca tacccacatc agcagctgct 183 60 

atctacaggg cctgagtaca atttgcctta 1842 0 

cgtggtgttt atagaatact gcagatggca 184 80 

gtatagtcct taggtaacga tagtagccat 1854 0 

atcatataca tatcatcaca aacaccccta 18 6 00 

atggacaagg aagctgaaac tcagaaaggt 18660 

caagtagagg acgctgaatt caaacccagg 1872 0 

atacattaca caccttcccc cagtctcaac 18780 

acttcagcat ccattctgtt aatctctgca 18840 

ttccaaagga tttattttta aagcatacct 189 00 

ccagcaaaac agggtataaa gtgaaaaggt 18 9 60 

ctggagtata atatttatca cggcttgtat 19020 

gttggccaga tgagggagat gggaagagct 19 0 80 

gcaagggtgg catgtcctga gggcccctaa 1914 0 

tgagccctgc ggtgattcca aatgtgacca 192 00 

ccctccttca gaggacagtc acctccagag 192 60 

gtccccacac ccagcacctg gcattctaga 19320 

aaagaatcat agcagtgaaa aagagagtct 193 80 

gaaggcagtg ggtaccagca ttcactttcc 19440 

acctatacat tattatcatt agctgaaaaa 195 00 

tttaatctgt ctttcctact gaactcaatt 195 60 

tccaatagtc agaatttttt aagccagttg 19620 

atgcaagtgc aatgggcacg cgtctctgtg 1968 0 

caccaagggc gtcaaggggg acgagtgcca 19740 

gggaggacca agaggctgtg cagctgcctg 198 0 0 

gtgtcttgtt gctgtgggct ctgagggatc 198 60 

ttctatttca gacaggtcag tccccaaaat 19920 

taatggctct tgaattcaca tttttgttta 199 80 

ttggagtagt cattttaact tcagcaacta 20040 

tctttcccaa agagtagaga tggaagagct 2010 0 

ttggttgtct gattgtggaa atgatcctct 2016 0 

tttccttgtt gaagcatgtc aggcatcttc 2 0220 

ggatgcaggt gcagcagcat caggcatgtt 2 0280 

gagcaaggtg ctcttgaggc aaatcactgt 20340 

gtaattgttt ttggtgtcct ggtaggttga 20400 

taccgggctg tctcactggg aaacctcata 20460 

aagaaaaaac ttaaaaagtg tttagtgtca 20520 

tcattatttc ttaaataaca gtagtaataa 20580 

tgctactcag cagggtttgg tctgtctact 20640 

agcttcatgt tgccctaaag ccaatatgtg 20700 

cttttaggtc cacattccct taaacagaaa 20760 

ccctcagtta gctgatgcaa caaagacatg 20820 

ttcgagatcg cagatgtagg caggtgctgc 20880 

cctgcctggg aaaggccatt acccacttcc 20940 

gtttacctaa atgcttcatg aagtgggagg 210 00 

gtatggggaa attgttttct gagtagaaca 21060 

gttccagggt ttaaaaaaac agtgattctt 2112 0 

agaagagaat tagtttatcc agggaacata 21180 

atgttttgac tgcactttta gaagaatatg 21240 
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tagtccacac aaaattgaaa atgtttttca ttatagtaaa taggtttatt acatttggat 213 00 

ttctctcatt gctttggcaa atgcttgtca attgtcctca tatagttggt gtgtgaactg 213 60 

ctgagcagtc agtattgaag cgttcatgca tgctcttcct agactgcctg agctgagtta 2142 0 

tggtgaagga tgcaattata atggctccag actatctgta ctttatataa aaggatctgt 21480 

ttggttttaa aatgaattca gtttctgttt taaatagcag tataaaatag tcttttttcc 21540 

cccccagatg tgaggtagaa aatcgatacc aaggaaaccc tctcagagga acatgttatt 2160 0 

gtaagtggtt ttgcaattct tatttctaga agcaaagtag ttcagtaaaa cttcattgtt 21660 

taaacgggtt tgagaatagt aagtgctata agactatagc agccaccaat gaagtgttcc 21720 

cagacttgat atgtttacat tctgttaagt ttactacata taggagcact gttttaagct 21780 

gttttaattg tgtttggggt taacgttaat gtgtccatag cagatagcag gagagagtag 2184 0 

agaggcatgc atctttgtct atccacattt atgttctctt aaaactttac tttatttgcc 21900 

attacctagt tggggtcatc atatttcgtg ttttaggatg tagatcaaaa acagaattct 21960 

tacaatatgg ttgaactttt tgtcattctg tctggaactg atttgtgtta accaccttga 2202 0 

ggtgaaaggt gagtctgaca aggtgaccat gtttttatgg gtaaatgtgt tttctcttta 22080 

tgggagaatc cacatggtag accagagtac gggaccagaa aaaaggagtt aatgttatgg 22140 

catatccatt gtaattatat acctgctgta ctgggttgaa taacatccct tcaaaattca 222 0 0 

tgtccacttg gaacctgtga atatgacttc atatatatat atataaaact tctaaaagaa 22260 

aacaagagaa aaacatcttg accttgggtg gggaaaagtt gttagatagg attctaaaag 22320 

cataattcat aaataaaaag tagactaatg aaactatata aaaatttaaa acttgtagga 22380 

aaaaaccatc caaaacataa aaagccacag actagagaaa atactcgcac atcatcatat 22440 

atttgataag gagtttatat ccagaataca tgaggaactc ttacaactga ataagatgac 225 0 0 

aattccagtt aaaaacaggc aaagatttga aaagacattt catgaaagaa gatattacag 22560 

tggccaataa gcacattaaa aacgacttaa catcattcgt tatttggaat ggagatttta 2262 0 

aaaccacagt cagatacagc cctaaagtga ttagaatcca acgctaatgc catggctttt 22680 

tagaagacag tggtaatctc atgtctgctt ctgcattcag tctgttgcag tacatctttt 22 74 0 

tggttaaaca catgaaaaaa acctggcctt accaggcatg cagttggaaa agggtatagt 228 0 0 

gatacccttt taatagcctt ttcagataat tatggacgtt acttgatatt atgctgaaac 228 6 0 

tggacaactg gtcatttctt taaagcggtt tgatgggtta acccaaataa aatgatactg 22920 

cccacagact tgattttctt tgaggaacct gggtacagct gagttaaagt gctttcctcg 22 98 0 

ttacttaatt tataagaaaa gcagcctgta tctctaaaga cctgttctat tttgtgtgtg 23 04 0 

tttagttttt aaatatgcat ttctttcttt ccatagatac tcttcttatt gactatcagt 23100 

tcacctttag tctatcccag gaagatgatc gctattacac agctatcaat tttgtggcta 23160 

ctcctgacga agtaagattt tttaaagtct tcctattttg ttttgaattt gtatggatct 2322 0 

ttttcttggt cattacggat ggacgtactg ccttaacagt gctctccaga ctggagtaca 232 8 0 

cgagatgatc tctagaggta taggaaagaa atgttagact ctacgttatc tcctttccac 2334 0 

ataaaaggca aaagtgatgt taataagatt taccaggatc ttagacacag actgacattg 234 0 0 

attccacgca tacttactct gcctgtcagc ccatcatggc ctcatacaga aaggggactc 2346 0 

caccatcaga gggcagatag cagagcctgg gtttactctc tcaagagtga ccagaggctt 2352 0 

aaagacactg ctgattgaaa tgccagtgat gcagccccaa tcagacagca agggagggga 23580 

ccccaatcag acagcaaggg aggggacccc aatcagacag caagggaggg gacactctgc 23640 

tcctagagtg agttcttagc ctcattagga ggcaaaacag caaaggctta gttgggtcca 237 00 

ttaagaagtt agccaggaat aatttaaatt gttaaaatat gtatgtaaaa tgtggatttt 2376 0 

tttatctgct gtcattaacg atgaagccca acctgcctta aggtattacc tagtggtaga 2382 0 

aggaaggcca cactgcggga catttaaaac tgaaacatac agaacacgaa gatgcacctg 23 880 

tacagtttct tcaatgaaat ataaagtcat gcagtaccca cttcagtatt taaagaaatt 23940 

ttggtaaaca taatggtaaa ttatttagga acttccttgg agatttctta cttctcatga 24000 

acatacacaa agccatttac cattacaaaa ttccattaac aataaatgtg acaaataata 24 06 0 

catggaaaac aatatggcag taagacaaag tatattgctt tgttaacaaa tgttatctat 2412 0 

aaacattgct aaatttaatt ttaaaagtag aacaaagtct atagtgtggt atgtttacta 24180 

tacaagtaaa tgaaaatagg atttgttttt aatattctat ttcaaagata aaattaagaa 24240 

aaaatgtact agaaaagata cattctcaaa agtaaattgc tttttaagta aaaataataa 243 0 0 

attactttaa atgaattatt ttatggaaaa actataggta ttaatatatc atttgagtgg 24360 

ctgtattgac tggaattttt ttcttaacga ttttagaata agattttaac aaaagtacca 24420 

tatatgaaat gtattcactg cctcataagc aagcgtttga agtgctcaaa atctttcagg 24480 

atatacttca tgccattaat gtcattaaaa aaataaatat agtagaatct ttgtaatact 24540 

tcttaaccag gtgggaagca accatgaaag tatttggacc tttctggatt tcccagttta 2460 0 

tcttacgaca gaatacttac tagagttatc caaatgcata tgttctgtcc tctataaaag 24660 

cacaagcatt ttaagtttat tgattctttc tgtggaaaga catatagttg acccttgagc 2472 0 
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aatacaggtt aaaactgtga ggatccactt acacgtagac ttttttcaac caagcgtgga 24780 

atgaaaatac agtatttgca agatgtgaaa cctgagtata cagagggtgg acttttcata 24 84 0 

tgcaagggtt ctatgggcag actgcggact ggagtatgtg tggatttggg catgctcagg 24 900 

gggtcctgga accgatgacc cacgtatacc aaaggatgac tgtaattgtt agttgtgtgc 2496 0 

tgccagcaaa tactaaatac aaataagtaa acacttgagc tgtcctttca agatgaaggt 25 02 0 

gaggtcttat cagtaaatga gaaggtaaat gctttgtgag agaaacttct ggtttaatga 25 0 80 

tcacatttta aaaatagctg tttggaaatg tttccattgt tgtgattttg tgctattaaa 2514 0 

atgatcaaaa caacaccctt aaaaatctta ttctaacctc tcaagatctt ttaaaaatga 25200 

ataatttcag tacagtcgga tgcatctgta aaagataaaa atataacatt gattagtttg 252 60 

caaaaataat tgtttgaccc cagttaagag atgtactagt caaatttcag tttgactaat 2 532 0 

tattaatgtt ataatttacc taacatcacc aacagtacac ttcctccact ggcttaacag 2 5380 

attcctcagc aatatcttta ttagtcatta agaccaagga tcaaaataaa ttaagttaga 25440 

ttagccctgt gaactgctat atctctaagt ggtagacaag gttttcaaaa ctaagaagcc 25500 

atactcaatc atatttctct tgaaataata tttttagtaa gagaaaaata tttttcaaat 25560 

catgaatata ataaaattat tttttaaatt aagtacattt cagctctata tgcctttttt 2 562 0 

aaaggctgtt tccagttttg gagaagtttt acactgtata taatctatga gtttagaatt 256 80 

atatgggttt cagttataaa taaatataat tttgggcact tgatcaagtg ttttattggt 25740 

aggagtgaac ttcagacatt tgaaaacagc tgcccagaga atccttagga gggtgattcc 258 00 

ccagcacagg tcaggagatg caaatcactg tgctctgata ggggtctttt taaaaggcac 258 60 

tttctcatgc agttaggcat ttgataaagc attaaatt.at gtatacttta atgggaggga 2 592 0 

ataaaatttt attggcatat attgcttgta ttaaggaaaa ttggttagaa attttgctaa 2 5980 

ctcttctgtg agtttctcta aagacatcat aaaatgtttg tgattaagaa catttagagg 26040 

agtaaacttt attgctttat tttaaaatct agaaattgtt ttaattaatt ttctaataat 26100 

ttgtaccgct catcagcaaa acagggattt ggacatgttc atcaatgcct ccaagaattt 2 6160 

caacctcaac atcacctggg ctgccagttt ctcaggtaaa gacataccta gagaagaccc 26220 

tgcaaatgaa ggtgtggtag attaagaaat gtaatatagg aattgagaaa gcgagctcag 26 280 

gagacagatt ggtttgaaac ccacccttgc cacttactag ctatgagacc ttgagcaagt 26340 

atctaaatcc ctctctaaac cttagcatta ttttattcat ctgtaaagtg aggataatga 26400 

tacttacctc ttagaattgt tgtatagatt aaattaggtt atacctacca gagcttgctg 26460 

tggtgtcagg ctcagtgtgt ggttactacc ctagcccacc accaccattt ctgttcttgc 26520 

tgtggccact ggcactacca tcattgtcta catccgtgct tcggaagtga aaaattcaaa 26580 

tgattcgttt caataaatga aaacatttta aataaaatga gattttagta ggtacagaga 26640 

aatgtaactt gggaattaca tcaagctcta aaagcacagc tcttgctgtc tgccttactg 26700 

tgattcactg aagatctact gtatagaaaa tctaaagaaa taaaggatga aggccaagtg 2 6760 

ctgtggctca tgcctttaat ctcaacactt tgggagactg aggcaggagg attgcttgag 2 682 0 

ctcaggagtt taagaccagc ctgggcaaca tagtgagacc ctgtctacaa aaaaaaaaaa 2 6880 

aaaaaaaccc aggtgtagtg actcatacct gtaatgccaa ctactcagga ggctgagatg 26940 

agaggatcac ttgagcccag gggttggaga ttgcactgag ccatgatcgc accattgtac 270 00 

tccagtgtgg gcaaacagag caagatccca tctctttaaa aaaaaaaaaa aaaaaagaaa 27060 

aaaggataat cactacttaa cttgataact caacaagtag atatgggttt gaaatttgtc 27120 

cattaaattt acttgcaccg tgctgttagg caagttactt aaggtttctg agccagtttc 2 7180 

ctcctgtata aagtaggata gtaaaaacac cgtcctggca gggcgtgata gctcatgcct 27240 

gtaatcccag cactgtggga agccaaggtg ggaagatcac ctgaggtcaa gagttttgag 27300 

accagcctgg ccaacatggt gaaaccctgt ctctactaaa aatacaaaaa tcagccaggc 273 60 

gtggaggcac gtgcctgtaa tcccagctac tcaggaggct gaggtaggag aatcgcttga 27420 

accttgaagg tagaggttgc tgtgagccga gatcacgcca ctgcactcca gcttgggtga 27480 

cagagtgaaa ctccatctca aaaaaaaaaa ataaaaaaac accttcccaa gtagagtgat 2 7540 

gtgagaatta aatgagataa taaatgaagt actcaatata gtgcttgaaa tgtggtaaat 27600 

ggtaactata ttttatcatt attactatta caatactggg tttttaaaaa tcaaaaacac 27660 

aaagcaatga gattgatgca aaataagaat attgccttgt gcacgccact tacgtttatc 2 7720 

atcttaaaac attgtgtaga atttgagaaa agttcagaaa ctctcaatga ggagggactt 27 780 

ttaagaaaaa gtctgaatta tcagagtatt tggagaaagg caacatctcc aggcatgtga 27 840 

aagatttgca atgagccggg cggtggctca tgcctgtaat cctagcactt tgggaggctg 2 7900 

gggcaggtgg attacctgag gtcaggagtt caagaccagc ctgaccaaca tggtgaaatc 2 7 960 

ccgtccctac taaaaataca aaactagctg ggtgtggtgg ggcgtgcctg taatcccagc 28 020 

tacttgggag gctgaggcag gagaattgct tgaacctggg aggcggaggt tgcagtgacc 2 8 080 

caagattgca tcattgcact ccagcctggg caacaaaagc gaaattccat ctcaaaaaaa 28140 

aaaaaaaaaa aaaaaagatt tgccatgagt gtctcaatga agacgfcgata atgtgggctc 28200 
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tagtcacagg gtctaactca gacatggaaa 
ttgaattcct ggctaaggga gaatgtggaa 
gttataccag attaggggga cttaaatact 
gtgacccctt gaacccctcc tccctgctac 
attactggca tgggagtctg gcgcattgcc 
ttgaagtggt gtgggcaaag tgaatgccag 
tattttcatg atcatgtgtg tcacctgttt 
atggattcct ggtaaaacca tttttctttc 
gctcttattg cctaggctgg agtgcaatgg 
cccaggttca agcgattctc ctgcctcagc 
ccaccacgcc tggctaattt tgtattttta 
gctggtctca aactccgacc tcaggtgatc 
ttacagatgt gagccaccgc acccggccag 
gtatatcagc ctgcccagct ttaaatctta 
gggaagttac ttaatttctg tgaacctcaa 
gggcagctgt gaggattaaa tgagattgag 
atgtttgata aatgctattg tccttattat 
ccccccaaaa aatgtctctt ctctttcatg 
gagaagagat gcctgttgtt tcaaaaacca 
atgagaagtt tgattttcgc aaccacccaa 
tcacctggcc catcaaaatt caggtaagaa 
gtgaaatctc tttagtaaga ctaaatttac 
caatgtggct ttgcctttgt aacgtgtatg 
acagaagatt ggagtgggga tggactgtat 
ttgaaagcaa gttggcatgt aacagatttg 
agcatttgcc atgtgaaagc ctgtagggct 
ggatgtaagt gaaacaaacc cacatataac 
gagatgatcc cagaaatgct atcaaagttt 
agaacatcag ccttctagtc agacagaggt 
acagcctttc caagtctcca gttctctgtg 
tgtgcagctg ctgcagtgtc ttgttagcct 
gtactccctc ttgtccgaga cccatggccc 
gggcctttcc tcaatacctt ctgcttctta 
gggataggtg ggctttcaag gactggttca 
ggaggctgtt gctgtttgcc aatgttcaga 
caaacatatg aaaaaagttc aacatcactg 
gtgagatacc atctcacacc agtcagaatg 
tgctggcaag gctgtggaga aacaggaacg 
gttcaaccat tgtggaagac agtctggcaa 
tttgacccag ccatcccatt actgggtgta 
aaagatacat gcacatgtat gttcattgca 
caacctaaat gcccatcagt gatagactgg 
gaatactatg cagccataaa aaggaatgag 
ctggaggcca ttatcctcag caaactaatg 
tcactcgtaa gtgggagccg agaacacatg 
gcctattgga gagtgatggg gaggaaggag 
ggcttaatac ctaggtgatg agatgatgtg 
tatgtaacaa acctatacat cctgcacatg 
aaaaaagcaa gttatcactc atcaattagg 
gatctttcca ttctggaaat atggaggaca 
tctctccttt ttcctgtctc catggtctaa 
tttgccctcg catactccct gtgtccctcc 
tcttattcct gttttctctt gtctagtcac 
ctcaccacat cctgaaactg tcctgtcata 
cctttttatc tcattgcatg gctttgaatc 
tttcctcgtt ttctggccag tctttttagg 
cttgtctagg attcagacct ggtccttttc 
ttgtggtgcc acatctattc tggtgattct 
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aaagtccatt tcattaatct ttatcggcac 28260 

cattgaagga ctctctggga ataggatgga 28320 

gtggtagctg gtggtagaag ggaggactga 2 8380 

agtgggttag gcagtgagcg gtacatcagc 2844 0 

aaggaggtgt aaaggggaaa tgcaaaggaa 28500 

tgcttgttaa taggattcta gtggtatctg 28 560 

gggggtgggg caagggtgga agggagttac 2 8 620 

tttctttttt tttttttgag acggaatttc 2 8680 

cgcaatcttg gctcactgca acctctgcct 2 8740 

ctgctgagta gctgggatta caggcatgcg 28 800 

gtagagacgg ggtttctcca tgtttgtcag 28860 

cacccgcctc ggtctcccaa agtgctggga 2 8 920 

taaaaccatt ttggttaggg gcataggctt 28 980 

tctccatttc ttgttggctg tgtgacttga 29040 

tttcccagtc tataaatgaa gataataata 29100 

ctcttaaagg ctattactgg aacacaggaa 29160 

taatgaggcc agattctgtt ctccccctag 2922 0 

tttttctttt aacagctgga acccaggctg 29280 

acattaagga gtacaaagat agtttctcta 29340 

atatcacttt ctttgtttat gtcagtaatt 29400 

gaggcttttg gtctcatacc tgcaaaggtg 29460 

taatttggag cttgtggtaa atgagatgtg 2952 0 

gcagaggagt gctgagcaca tgcatgctgc 29580 

cactcatgaa agacatttgc aaaagcactg 29640 

gtcattataa acgtattcac ttcttcagtg 29700 

acacaaagaa cttcaattct agagtaaggt- 29760 

aactgaaagc cagagtgtgg aaagtgacat 29820 

aaaggaggac aaatggggag actatgttga 29880 

ggattgattc ctggctccta tacaaatcat 2 994 0 

catgtgacct gaagggtagt tgtaaggggc 30 000 

gctcctttcc tctgttccca gggggccagt 30060 

cattttaact ttttatactc atgtcccctg 3012 0 

ccttcttcat ttaggtgaat gtggaggtta 3 0180 

cctttaacca tggaagcatg gtcactggac 30240 

agcataatca acctcagaag caagt caeca 30300 

atcattagag aaatgcaaat cagaaccaca 3 0360 

tctgttatta aaaagtcaaa aaagaacaga 3 042 0 

cttttacact gttggtggga gtgtaaatta 30480 

ttcctcaaag acctagaggc agaaatacca 3 054 0 

tatccaaaag aatataaatc attctgtaac 30 6 00 

gcactattca cagtagcaaa gacatagaat 3 0 660 

ataaagaaaa tgtggtacat atacaccatg 3 072 0 

atcatgtcct ttgeagggae atggatggag 30 7 80 

caggaaegga aaaccaggta ccacatgttc 3 0 840 

gacacatggt ggggaacaac acacactggg 3 0900 

agcatcagga agaatagcta gtggatgctg 3 0 960 

tgtagcaaac cactgtggca cacgtttacc 31020 

tacccctgaa cttaaaaatt aacaataaca 31080 

atgccttggg actgtgacta aagaacagtt 31140 

aaaactatgt tgtagttttt ctcaccaccc 312 00 

gatttgttaa tcctctcatc aggttctccc 31260 

tcagccagtc ttgtaagcat cagccacgct 31320 

acatctgctc acaatggtct ggcctcttcc 313 80 

gtgggcccca gtgatggaat ttttcagtcc 31440 

atcatctttc ttgtcttcct ggatttttct 31500 

gaatcctctt cctcttcctt aaacactggg 31560 

tctcttcact ctgtttcctg catgtgggac 31620 

cagtgctgtc tccagaccgg cactggcagt 316 80 



WO 01/78894 



PCT/US01/12245 



16/154 

tgctgcctgg acagccctac ctgggtagtc aaagctcccc gcgttcggtg catattcctt 3174 0 

cttgtgatct tagccccaga cttccatgct tcctttctct gtaaacagca ccaccatcac 318 00 

cctgttgtac aagctagggc ctgatgatca tttgcattcc tgggataaac ccaacttgag 318 60 

catgacgtat tatttcctta ctttacacgt tgttggattc agtttgctaa tatttaggtt 31920 

agaattttgt atctatttca tgagtaagaa tgtctgataa ttttcccttc ctgtccttgt 31980 

tatggttttg atatcatgtt aaactaactt ttaataatta taatttgagg agttttcact 3204 0 

ctttctcatc tctggagatg taagattaga gttaactgaa cctcaagtac ttggtagctc 32100 

tctcctgtaa aatcatttga tcctagtgtc ttatttgtgg agatactttt tagctgctga 32160 

ttcagcttct ttaatagtta taggatattt tgaatttcct ctttatttga ttcacttttt 32 22 0 

aatatttttt ctaggttttt atatcatgtt tttaaatgta tttttaattt tactttttaa 32280 

attgatacat aattgtacgt atttatgagg tacatgagat attttgatac atgcatacaa 32 34 0 

tgtataatga tcaaataaga gtaattagga tatccatcac ctcaaacatt tatcatttct 32400 

ttgtgttgag tacatttcac atcttctagc tattttgaaa taatgaataa gttattgcta 32460 

actatagtca taatgactat tgtgccattg aacactagaa cttattcctt gtaactctat 32520 

ttttataccc attaacctct atctccccag ccccccagca gctggtaacc accattctgc 3258 0 

actctacctc catgagatca gtttttttag ctcccacatc tgagtgagaa aacatatcta 3264 0 

tctttctgtg cctggcttgt ttcacttaac ataatgacct ccagttccat tcatgttgct 32700 

gcaaaggaca ggattccatt ctttttgtga ctgaataata tttcattgta taatatatac 32760 

aacattttat ttgtccattc atttgttgat ggacacgtag gttgattcca tgtcttcact 3282 0 

cttgtgaata gtgctgtgat aaacatttag agcatgcagt atctctccag tatactgatt 32880 

ttctttcttt tggatatata cccagcagcg ggattgctgg atcatgtggt ggatctatta 3294 0 

tgagcagtct tcatactgtc ttccatagtg gctgtactaa ataatttaca ttcccaccag 33000 

cagtgaacta gtatcgtctt tctctgtatc ctcgccagca tctgttattt tgtcttttta 3306 0 

ataatagcca ttttaactgg gatgagatga tttctcatta tggttttgat ttgcatttgc 33120 

ctgatggtta ctgatgttga gatttttttc atatgcctgt tggccatttg tatgtcttct 33180 

ttggacaaat ctctattcag atcatttgcc catttttaaa tcaagttttt ttcctattga 33240 

gttgtttgaa ttgctggtat attctggtta taaatccctt gttggatgga tagtttgaac 33300 

atattttctc tcattctata agctgtctct tcactctctt gtttcctttg ctttgtagag 33360 

ctttttggct tgatataatc ccatttgtcg atttttgctt ttgttgcctg tgatttcgac 3342 0 

atcttacaca aaacatcttt gcccagacca atgtcctgaa ggattttccc aatattttct 334 8 0 

tctagtagtt ttatggtttt aggtcttata cttaagcctt taatccattt aaatttgatt 33540 

tttgtatgtg gtgagagaga ggggcctagt gttatttttc tgcacatgga tatccagttt 33600 

tcccagcacc atttattgaa gaacctgtcc tttcccccac tgaatattct tgactccatt 33660 

attgaaaacc agttggccgt gaatatgtgg atttatttct gagttcttta ttttgttcca 33720 

ctggtctgtg tatctgtttt tatgctggta tcatgctgtt gtgggtacta tagctttgta 33780 

gcatattttg aagtaaggtg atatgatgcc tccagttttg ttctttttgc tcagaaatgc 33840 

tttggttggc tgagtacagc agctcgtgcc tataatccca gcactttggg aggccgaggc 33900 

tggtggatca cctgaggtca ggagttcgag accagcctgg ccaacatggc aaaaccctgt 33960 

ctctaataaa aatacaaaaa ttagccatgt gcagtggtgg gtgcctgtaa tcccagctac 34 02 0 

ttgggaggct ggggcaggag aatctcttga acccgggagg aggaggctgc agtgagccaa 34 080 

gattacgcca ctacattcta ccctgggcaa acaaagcgag actctgtctc aaaaaaaaaa 34140 

agaaaaaaaa aaaaaagaaa tgctttggct atttgaggtc ttctgtggtt ccatacaaat 342 0 0 

tttaggattg ttttttctat ttctgtgaag aatgtcattg gtgtagagat tacattggat 3426 0 

ctgtaggtag cttttggtag tatggttttt ttcacaatat taattatttc agtccacaaa. 34320 

tgtggtgtct ctttcaattt ttttgtgacc tcttcaattt cttacatcag tgttttatag 343 8 0 

ttttccttgt aaagagggct ttcacctcct tggttagatt tattcctacg gttgtttttg 3444 0 

gaagttttta aaaaatctat cgagaatggg attgctttct tgatttcttc ttttgctagt 34500 

tcgttgctcc tatatagaaa tgattctgat ttttgtgtgt tgattttgta tcctgcaact 34560 

ttactgagtt tgtcagttct tagtttggtg gagcttttag ggttttctgt atataagatt 3462 0 

atatcaactg caaataggga cactttgact tcctcctttc agtttggatg ccctttattt 34 6 80 

ctttcttttg cctagttgct ctggtcaaat atgttgataa cttgttgatg ctgtcctcta 34740 

atgctcgcag cgtctgtgtt catgaaaccc tttgttgtat tccaatgttg atagcattat 348 00 

tcacagtaat caaaatgtgg aaacaaccca aatatctatc agtggatgaa tggataaaca 348 60 

aaatgtggta tgtatgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtata 34 92 0 

ataaaatgtt attcagcatt aaaaaggaat gaaattctga tacatgcgac aacacaagtg 34980 

aaccttgaaa acatgctaag tgaaataagc tagttgcaag aggacaaata ttgaatgatt 3 5 040 

ccacttaaca tgaaatatct agagtagtac acagattctt agagacagaa agtggattgg 35100 

aggttaccag cagctagggg gagcgggaaa tagggaatta ctgcttcatg gttattaaag 35160 
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agtttccctt tggggtgatg aaaaagcttt ggaagtaggt agtgatattg gttgcacaaa 35220 

attgtgaatg taattattgc cactgatttg tacacttaaa aatggttaaa gtgattttat 35280 

gttatatctt actacaataa aaaaagtctt taaaaatctc agcaagatgt ttttgtagat 3534 0 

atagacaagc ttattctgaa atttatatga aagggcaaaa gttctagaat agctataaca 35400 

atttagaaaa agaagaataa aatgggaaga atcaacctac ccaatgttaa ggcttactat 35460 

ttaggtacag taatcaagac attgtttcag gtagaggggt agacacacag atcaatggga 35520 

taggctgatc tataggtaga tcccagaaat agaaccacat aaatatgtcc aactgattag 3 5580 

gatttgattt tgtttgtggt gtttttttga gacagggtct cactatgatg tccaggctgg 35640 

tcttgactcc tgggctcaag taaccctctc acctcagcct cctgaataac tgggattaca 3570 0 

ggtgcacacc gccacgctag ttccagctga attttttttt tttttttttt tttttgagac 35760 

agggtctcac tttgtcgccc aggccaaagt gtagtggtgc agtcacagca cactgcagcc 3582 0 

tcaacctccc tggctcaagt gatcctcctg cctcagcctc ccaagttgct gggactacag 35880 

gtgcatacca ccatgcctgg ctaattcttt tttttttttt tttttttttt tttttggtag 35940 

agatgagatc tccctgtgtt gcctagtctg gtctcaaact ctcaaactcc tgggctcaag 3 6000 

tgatcctccc gccttggtct cccaaagtgc taagattaca gacatgagcc gttgcgccca 36060 

gcccccagct gatttttgac aaaggccctg caaaggcaat tcaatgaaga aaggatagcc 3612 0 

ttttaacgaa tagtgctaga gcagttggac aaccataggc aactaaagga atctctaaat 3618 0 

ctcacatctt atataagaat taacgcaata ttgatcacag atctaaatgt aaaacataaa 3624 0 

actataaaac ttttagaaaa aaatatagga gaaaatcttt gaatcccaaa acgaaacaaa 363 0 0 

agctttttag actcgatact aaaaacgtga ttcataaaag gaaaacttga taaactggac 36360 

cacataaaac caaaagtttt tgctctgtga gaaaccctgt tatgaagatg aaaaggcaag 36420 

ctatggactg ggagaaaata tttgcaaacc atatttctgg gaaagcactt gaatctagaa 3648 0 

tataataact ctcaaaactc aacagtaaac aaacgaacaa ttcagttaga aaatgggcaa 3 654 0 

aagacatgaa tagacattta accaaagagg atatacatat ggcaaataat gacataaaag 366 0 0 

atgatcggca tcaataacca ttagagaggc caggtgtggt tcacacctga aatcccagca 3666 0 

ctttgggagg ccaaggcagg cagatcacca cttgaggtca ggagttcaag accagcctag 3672 0 

ccaacatggc aaaaccccgt ctctactaaa aatacaaaaa ttagccaggc atggtggcac 36780 

atgcctatag tcccagctac tcaggatact gaggcaggag aatcgcttga acctgggagg 3684 0 

cagaggttac agtgagctga gattgtgcta ctgcactcca gcctgggcaa cggaatgaga 3690 0 

ctgcctctca aaataaaaat aaaataacca ttagagaaat gcaaattgaa agcacaatga 36960 

gataacacta cacacttatc aaaatgacta aaattaaaaa tagtaataac accaaatgtt 37020 

ggagagagtg gggaaagatt ggatcactca tacattgttg gtgggaatat aagatggtac 37080 

agtcactctg gaaaatactt tgtttcttaa aaaactaatg ttatacttat catataaccc 3714 0 

agcaattgca ttcctaggca tttatcctag agaaatgaaa accagtgttc tcacaaatct 37200 

ctgtacaccc gtgtttatag cacttgaaag caatatttaa cagctttatt tataatagcc 37260 

caaaactgga aacaagccaa atattcctca acaaatgaat gcttaaataa atgatgctac 37320 

atacacacca tggcctaata cccagcaaca aaaagaaatg aactgttgat atacagaaac 37380 

acaacaactt tgataaatcc caagcaaatt ttgctgattg aaaaagaaaa tcttagaaat 3744 0 

ttatatactg catgattgca tttatataac atttgaaaat gaaaatttta gatcttatat 37500 

acctatgcac atattgaata agtatcctga agcttcaaat ttctctgact tcttaggact 37560 

ctgatctcta gaccctgttt taaaggagct tatctgatta ggtcaggcct acccacacag 37620 

gagtagggaa gatacagagc ggtgcccatc aagaaggtgg aaatcttggg ggcattctta 3768 0 

gaattctccc tgctgtaggc tgcatttgga atagagcttg caggggtttc ggtagtttct 3 7740 

ccgatccgaa aaaggttgag ctctaggtcc cagtcagtat aatttcagtt tggcaacatc 3780 0 

tcctgacact attaagatat agattatata tatttgttag gttattttca tgtaaacact 37860 

tgattagggt tctagtaaat aaaatttatt tatttactca ttttaagttg agattgtttt 3 792 0 

tgtactgaca tacctctcca tattctcaat ctaattgaac atcgtgcttt gttgcttata 3798 0 

gctgacaaat gattaatact ttttagattc ttagtgaaaa tcttgactta tgctgctgac 38040 

acttttccca gtctgtgcct ataagcaggc ttttgtgctg gagatggcca tacttgttgc 3 8100 

caatgccgag tttttttcct cccagttccc tattgctcca ccctctgaaa gtcactgaag 38160 

tggttttaga gctttcattc tagagggagc tcttccccac acttacttct aagaagtaat 3822 0 

aattttaagt ttcataatca aggagatgtt gattttcttc ctggttttct ggtgctcctt 3828 0 

atcttttctg gatcactaac ttaaattaag atccacttat gaatgctgca ttcaggagca 38340 

cttgtgaaaa taaacgatgg catgcccacc atttagtttg ccaagctttc cttgaaaacg 38400 

caggtaagtt ctattgcaaa cgtgtctttt tgctgtttgg tacaaatgac tgcactattc 38460 

actgtgacta ggcccgtgtc accgtgtgtt ctcagtcctc gcatgagagc ctggttgaca 38520 

ctggttctct ctgctttggg gaccgtatct ggctgtgcgc tatgtggagg gctgagctcg 38580 

tgtcttcctg ggtaaataca tcatgttctc aacaagggct gcatgtttct ttattgctgt 3864 0 
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tttttcccca gagacttagc ttttctaggt gtgtagcatg gctcttaaat acacacattt 3 8700 

agcaactgtg caaattcagt aaaaatggag agacttattt aaggaagaca gtagggaaat 3 8760 

gtctaaatag tgcagtgtag gccattcagt taaatttcag tatggctttc gatgcaggca 3882 0 

aagccgtttt gcctttgatt ctcatttttt aaattagatt ttaaaaattg agattctcct 38880 

tttttaaatg agattttaaa aattgagatt ctcctttttt aaatgagatt ttaaaaattg 3 8940 

agattctcct tttttaaatg agattttaaa aattgagatt ctcctttttt aaatgagatt 39000 

ttaaaaattg agattctcct tttttaaatg agattttaaa aattgagatt ctcatttttt 39060 

taaagcaaag taactgagga aatctaattc tctacttgga aatttgtgta agcattgggg 3 9120 

ctttaaaaat ggtaatacta aactcaagtt ccttcactct gaaccaatcc caggattctt 39180 

ttccataatt aggaacagat caaaacctat aactgctgag ggaatttgaa actgactata 3924 0 

aaaatctttg tagatgcagc taaaaaacaa acaaaaccca cagactgaaa agtcctgttt 39300 

ctgggagggt ggcgggtgtg ttggggccgg ttgtgcctgt tttttgattg gttgttcctg 39360 

tgggtgagca cccttccctc tgcccctgct ttctgaaagc tatcctagtt gttctcccca 3942 0 

gtaagccggg gctcagccgg cttcagctat gtccatagtt tgggggcgtg ' ggcggtaggt 39480 

attcaagata gaagatggtt ggacagaaag tgtgtgaggt gggaaggtgg agtggtgagg 3954 0 

ctctggctcg ttcccgactc tgcaacagag gttgttactg taatgtggag ctaatataga 39600 

agtctgcgtg ctgggtgttt catctactgt tttatgtaag aggccagtgt tcacaagtaa 39660 

aggcaacaca gaattggtat tgagaactgg gcattcattt agattttaga gttctgttta 39720 

tgggtgctaa caggatcgct atactgagca agggatgtgt gaaacttttg ttcacttgga 39780 

tgtgcttttt ttcttataac agctttattg agacatcatt tacataccat acagttcgct 3984 0 

catttaaagt acacaattca gtggttttca gtgtattcac agaagcgtgc agccacccgt 3990 0 

cactacagcc aattttagaa tattttcacc acttccagaa gaagccccta gcagtcagtc 39960 

ctcattcccc cctcttccgg caacccccag tctgtatttc ctgtctctgt ggatttgcct 4002 0 

attctggaca ttttatataa attgagtcac atgatatgct ctcacttagc atactgtttt 40080 

caaggttcac ccgtgtttag tatgtttcag tacttattcc ttgtcatagc tgaataatct 40140 

tccttatatg tatataccac gttttgttta tccatttttc tatcgatgga tgcttgggtg 402 0 0 

gttttcactc tttggctatt atgaatagtg tcagatatac ttttaatgtt tatatttatt 40260 

gtataataac agtgcatatc cgtacctcct ggtgttaatt gtctgtatct taagagtttt 40320 

aagaattata tggttgtgct gtaattctag cacttaggga ggccgaggcg ggcagatcgc 403 80 

tttagctcag gagtttgaga ccaggctggg caacatagta aaaccctgtc tctactaaag 40440 

tacaaaaaat tagctgggcg tggtgaagtg ttcctgtagt cccagctact tgcaaggctg 40500 

aggcacaaga atcacttgaa cccaggaggc agatgttgca gcgagccaag attgcaccac 40560 

tgtactgcag cctgagcgac agagcaagac tcctgtctcc aaaaaaaaaa aaaatttttc 4 0620 

tttatgatga aacccacaag gctgtacaag aaccttaaat atagtttgaa acaaagaaag 4068 0 

acacatgctt ttaagtacag tttaattctt ttcttttctc ttatttcatt tttttttttt 40740 

ttggagacag agtctcactc tgtcgcccag gctggagtac agtggtgcaa tcttggctcg 4 0800 

ctgcaacatc tgcctcctgg gttcaagtga ttctcgtgcc tcagccttgc gagtagctgg 40860 

gactacagga acacttcacc acgcccagct aattttttgt attttagtag agacagggtt 4092 0 

ttaccatgtt gcccccatca ttctcagcaa actatcgcaa ggacaaaaaa ctgaacaccg 40980 

cttgttctca ttcataggtg caaactgaac aatgaggacg catggacaca ggaaggggaa 41040 

catcacacac cagggcctgt tgtggggtgg ggggaagggg aagggatagc attaggggat 41100 

atacctaatg ttaaatgacg agttaatggg tgcagcacac caacatggca tatgtataca 41160 

tatgtaacaa acctgcatgt tgtgcacatg taccctaaaa cttaaagtat aattaaaaaa 4122 0 

aaaaaaagaa aaatgtgtta ttcctttgtc tcgaaagtcc cctgcttttc agcttttgct 41280 

ttggttgtct cttgtcagct ctttaatgct attgttccct tgggtttcat ctggactctt 41340 

tttccctggg caatctcatc cacaccatct gggaacattg cctttggctg gacaggtaaa 414 00 

tctttgtctc tagcagactt ctctcctgtg atctccagag ccaggaaccc actagacttc 414 60 

cttaaaattc attcagccag cagactttct ggcccagagc agcagctgtg agtggacagc 4152 0 

aggcacctcg ttagaagctt ggtgttggag gtgtgcccat gcattcctgg gagccatgag 415 80 

aggggagggg agtgccctgt gtcacctgag tccaagttaa gagagatcac accagagctg 4164 0 

attcctaagg agcatggggg cgggggaagg gggtgttcgt ggtggggaaa ggaaatgacc 4170 0 

agccctttca ggaactacca ggagtttagc agagctggtg acagtgctat gagcaagaag 41760 

gtggagagct ggggagggcc attatgaaga gactggagat tgcaggctgt gaacttgagt 4182 0 

cctgatgtga ctgggaattg ttgagatgtt ttaggccgaa aagtcaccga cgatcagtgt 4188 0 

ttagaaagac cccactggca gcagttggga gtgtggattg gaaggagcca ggcctaagtc 4194 0 

ctgggaggac ttaggaggtc caggagtcca gggagtccag gtggaattct tgtgtagaaa 4200 0 

cccagggaag gggaccatct gatgacacac atcccatagc aacttccttt cctcctctgc 42060 

acctccacag cagagtgtct gtttccacta tacgaagatt tcttaatggc aaggatcgca 42120 
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tttttatccc tcagtgtcct cagtgtcatg tgctaatcag ttggagacac tccctaaatg 42180 

tatgccagat gagtatatag ttgttagtat aagggcttta cggtaggtat cagaactttt 42240 

tcttttttac aacttttata acttttataa aggagccgac acctttttct gatattgtca 423 00 

gaacaattgc tgaccagtgc tatgggctgg ctgaggttgg caaccaagaa tctgttatct 42360 

cagtctacac agcgtggagt tagaagattg gattgttgtt ggttagaatt ttgtccagag 42420 

agttagaaga tagaagaaaa gggagggctg aggacataaa catccaaatt ttacaaagaa 4 2480 

gaaagttcag tgctcctgtc tatattgaaa tagttaagag ggaaatgatt gggtctgggg 42540 

aatgcttcaa aataatatga tcatagggaa agtaagtgta gtgtggatga aacaaggttg 4260 0 

gccttgagct gataatttac taaagcagga tgatgagcgt gtgaggttca ttatagtgct 42660 

ctttctactt tttaaaaagc ttaagatttt ccaaaatgaa aaagaagctg agcatggtag 42720 

ctcacgtctg taatcccagc actttgagag cctgagacgg aaggatcact tgaagccagg 42780 

agttcgagac cagcctgggc aatatagtga gacccccatc cctactaaaa aagaaaaaaa 4284 0 

attacacaat gaaaagaata atacaggtgg tagaagaaga aataaaggta tgagtatatg 42900 

atttggcctc ctacagacag ctacaggagc atgcctctgc tgggcgggcc tggagcatag 42960 

tgcaccctgg agctgaagtt ctgttcgcct ttacaggaag tacacagcat tgtacggggg 4 3 020 

ttttaccagt taacagtgga cagtctctag aacactcagg ggtcaggggc aaaactccct 43 080 

ctccctcaaa ccagcccacc tgtcttggtg agaatggaaa acggagcttc ccagcaagcc 4314 0 

tcctgccttc accccgacgc acaggtgcat cagggtctga gggctcctcc ctgtggggcc 432 00 

tgggaagccg cccttcagtg ggctatggct gcattgtctt ctgatcttgt agctgtgtgt 43260 

gggtaacagg cacatcgtct ccccatatcc cagagagtca tgtccacttc ctatctgtgg 43 320 

tggcatcttt ccatcactgt caccctcagg ataatggcac aggaactaac agaggcacta 43380 

agagcagtga tatatactgg agggaggggc ggggctgatg accaaaatca ccttctgttt 4344 0 

cagtaagagg ttggttaaag tgatgcctaa aatggataga tcaggaaata acaataaaaa 43500 

cattaatgat aaaaaggcaa ctgccagaag aactgaaaag taacctagtt caaggtgggg 43 560 

gtccctggag agaagccttc agaagagcag aggaagagca gggaacattg cttttcatga 4362 0 

ctagcctttc agtgcctttt gatatttcag ctaagtatgt agttctctaa tgatttctaa 43680 

attgttaatg agtgtgatag aataggaatt ctggagaatg gtcaaaagga attgctagga 43740 

gaaaattagg gaaagaaagc acagagcagt attgaggtgg gaggttggga gcagacagat 43 8 00 

ggtgagtgtt cgtttgcatg gcaggaatga gtggctgaat ggcattaact ggccatgggg 43 8 60 

ctctgaccag agcttgggta tacagtttct gtgccagcac aaaccagcat taggtgcgtg 43 920 

gtgtgtgagg ccgggtgatg gtcatatgga tctggtccag agctgtggtc ttgacataga 43 980 

gttccatgta tgaactttgg tggtggtggt ggttttgttt cgttttttaa gaaaaaaaga 44 04 0 

agagttgttt tatgtcacag ccaaactcaa ggaggtgaca gcacaagacc caggttagaa 44100 

tgtacttagc cagggctcac ggaatcttcc atcctttgca aagccaagaa gagtagaggc 44160 

ctcaaggaga atagaggagc acatgggctc cttctgggct gagctctcac ccttggtggg 44220 

gccgctggtc ccagggggtc gttgggagcc tgcggagatt ctgcccagta catacatacc 442 80 

ttgcgttttc tcggagcctg acaggtgctg ccttgaagcg gaaagacttc acctcacagg 4434 0 

gtggccagct gtcctttctc ttcctctccc tgtcctgtgt gctcatgata atggatagat 444 0 0 

agacatctcc atggtagaaa taatgcagcc ctttctgcag tccatggttc cacagcatat 444 6 0 

ggtcagtaga cttagatgag tgctacataa agtgtaatta gttctgcacc acggtgtctt 44520 

accttctgaa aaggaaactc tggtattgac ttagaaatgc cagttttatc cttccgacca 4458 0 

gaacgcaagt cctgaatgag gccagaggcc aatgaaaggt cctttcttct ctcttttcaa 44640 

atctgctgta ttaaacagtg acttttccta gtcattcagt attcattcat tcatccattt 44700 

attcaagttt ttgagtacct tctttgtgga tggcacatag tagatgcaaa aatacaatca 44760 

tttttaaaat ggaaagcatt tgttatgtgg tagtggaata taagaaatgc catgcttcca 44820 

tatttttatg gtaatcttgg actcaggttc tacccaccat acttaattag aaattcattc 44880 

tcagcctctg ttttggcaga gacccattct cttccctcag gaacatacag aaacctcagg 44940 

gctgaactga gctcagaggt tggggaaggt cccctggtct gatcttcagt cgctgactgc 45000 

tgcgcccgat tccctggccc ctcactcttg agggcagcat cctcactctt gtgtggcaca 45060 

ggcttggaca tagaatgaca cttaagcacc agagcctccc tagcctgcca caaggacacc 45120 

ctacattccc aggcactttc agaatgcaaa acctcccctc cattccctcc ctttccctgc 45180 

ctccaggttt gggccctgcc tctccttttg tttcaaagca ctgtcctctc ttcctctacc 45240 

cttatccttc cccatctccc tcccactccc caaaagagcc aaactaatta tggggattta 453 00 

accttccaat taaaggcgag gatttttttt tttcaatgca gttcctgaaa tgctaaagaa 453 60 

attgtccagg gaaaggagca caggaggagt tttgttttgt tttgttttgt tttgttttgt 45420 

tttgttttgt tttgttttga gacagtctcg "ctctgtcacc caggctggag tgcagtggtg 4 5480 

cgatcttggc ttaccgcaac ctctccacct cctgggtttg agcaattctt gtgcctcagc 45540 

cccccaagta gctgggatca caggtgcaca ccaccacgcc agctaatttt tgtattttta 45600 
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gtagaaatgg ggttttgcca tgttggccag 
ccacccacct cagcctccca aagtactggg 
tgtttaaata atttcttact tcagagtttc 
gaatcactgg aggagcttaa agatatagag 
ttgtaagcta tccagagttt gagagccatg 
gctcttgtat gggaggtagg aagttacctt 
aagttaggaa ttggtatctt catagacaga 
tgtctttaca cagccatggt atatggtaga 
tacatgcagc ccaggacggc tttgaatgca 
catgattttt ttattttatt tttttttaag 
agtgtatttt atgtgtggcc aaagacaatt 
ttggacaccc ctatatagac cataattatc 
tgagggagcg tagaggagag cctgccagag 
ggaggcagtg atgtgctgag ccagaaagga 
aggctctctg gtgacatctg ccgtcaggta 
acttttggct gaagtgcaac attagtgatt 
caaaccctgg ggctcactga gcttccgagt 
cagaaggccc atggggcaga tacaccttag 
gttgtggaag tcgtaggacg tgaattctag 
tgtttggaaa cagagaagaa tgacattatg 
ttgtattaaa atagattttt atataggaaa 
ccaaatctat ttcaccgttt ccataataaa 
tctttaaaaa tgaaaccaaa aagtatggga 
tatagttaca atcatgaagc aggtatggtg 
aaaaaatcag atcttactga gttctgtgag 
catttattca gcaaatgttt attaagtgcc 
tggagatata tggtcagtgg gataagggga 
ttttccataa gtaagtgtac aaatacgggc 
gggggtgtgg cttcagttgt gcacatggtg 
ccttcacttg agacttcaca gcaggatgta 
tcccaaagtc ctcccttaat ctcgtttaga 
ctttatccct tcttcagttg ataactgaag 
tggttttgtg cgagcagttc tccaacagcg 
ggattttaca gttggttggc attgtatgac 
tttgttaatc aagtttatgc cttttacatt 
taacagcact ctccttcatc ttcagaagtg 
tcagtcaaag gttgaccaac aaagactgac 
atgtttctaa gtggtcagtg ctttcataaa 
cacaatcttg taactatgga gacttagatg 
agtcttgggt accagccata ccaaaagctg 
tggacatgaa catggagtgg tcctgcttca 
ccagtgtggg taaaggcggg ggctgccctg 
caccagataa gtcctcagac agtgccctta 
agtaaaagta tctcatagat tctttaagtg 
aagattgaaa tccagacagg gctggggtgg 
gtgttataac ttggattctt cattccccaa 
tatctaaaac ttttatgtga ggaccatggc 
agctaggact ggttattgtg ataaatgaag 
tgtatgaaat tgatgtaact ttgtatttct 
aaaaacttca aattccaagc cccataatat 
tatttatgga atgtcaaaaa gaaaataggg 
tttctggccc ctgtgagtct ggctctaaaa 
taggaaagca gatgctcgtc atttaccttc 
ggattgtcac cttggtcagt aattgtttcg 
ctgagttccc taaaaaaaag caggcagcgc 
tatcggtgat agtaaggaca caattctcgt 
cgtgggctaa tatccctgaa atcatgatgc 
tgtcatagtt aaattgacag tgtttttcct 
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gctggtttcg acctcctggc ctcaagtgat 45660 

attaggcgtg agccactgca tccagcctga 4572 0 

ctgaagtgtg gcctatggag actagcatca 4 5780 

acagggatag ggcctgagat tctgactttg 4 584 0 

gcctagttct tggaatacaa aagtgtatct 4 590 0 

ttcattcatt cttctacaac agtaatttaa 45960 

agaatgcagg aatcctcttc tcaggatgtc 4 6020 

ccaagtttgt ccaacccatg gcccgcgggc 46080 

gcccaacaca attcataaac tttcttaaaa 4 614 0 

ctcatcagca ctatcgttag tgttagtgtt 4620 0 

cttccagtgt ggcctaggga agccaaaaga 4 6260 

acacgccctg agagggaagc agatcaatgg 4632 0 

tttccagtgg ggccggggat gcttaaaaac 4 6380 

cagggaggac ccccatgcac acaggcaggg 4 6440 

cagtagtaca gtagtagagt gagcggaagc 46500 

gaggaagtgc aggtccgtaa tatcttaccc 46560 

tcagagtttt tcagaattga tgaacacatc 4 662 0 

acatattgat gaaagtggag ttggaattgg 46680 

gtttcttgga gtgtggagta tggtgtctac 4674 0 

agttttattt ataaaataag aatttcttct 46800 

ttttcattac cttcagtatc agaaaggccc 46860 

ttgtgaaaga aagatttgat tgggatattg 46920 

aaata'cttta attttagagt taacaacatt 46980 

ctgcttcatg agccaagttc ataaaagaaa 4704 0 

acagatcatt atttttctta ttaacatatt 47100 

aattaatgta tcagtggctg ctttaggcac 47160 

agtctctgtc ttcaatcctc acagatctta 47220 

tatgaattaa atagagcaag ataaagagat 47280 

cccagaaatc tgatagctcc atcatggagt 47340 

aaaagcaaca gccaggaaac tccttagttt 4 7400 

gaattatgaa acacttccta tggcttcatc 47460 

ggctaaatct gccaagtctt cctttgccaa 4752 0 

cttccactaa aatcttgtct tttacaagat 47580 

agtctgtgag tagggacagc ccaatgtatg 4764 0 

gagtaatttt taattacctc gtgtaatttt 47700 

ttcagtttga acaataagtt tgtcatcttg 47760 

cgcagtgggc ggggtattca gcagggttag 4 7820 

cattttcacg ttatgacctt tgcctcctca 47880 

ataaagattc aactaagagg atccctttaa 47940 

ttttatatga ggttctttat gattataatt 48 00 0 

gcaagaaatc aggctaataa ccattagcat 48060 

aggagtgttg tgtggcacag ggcgcatgaa 4 8120 

gcatccagtg gtttttggca tctagcaggg 48180 

taaacaccga ggacacactt aacgttttaa 4 824 0 

actctaatct gtttccttct ctctctttat 483 0 0 

aatgaccttg tttatataat tctccagtct 483 60 

tcatattccg cagctttcct tctctcaagg 4 842 0 

gaaaagaaaa gaaagtgggt ttctgtttat 48480 

gattataaat ggtagccaga gggaggacac 4854 0 

attatggcct gaatagtgtt ggaatttatt 48600 

ccatcaaaaa agcaaacagt ggaagtctat 486 60 

acagtgcaag gggttggggt ttggtgggac 48720 

ctgtgtttgt catgagaatt gtctgtactg 4878 0 

tttatagtac cattgtctct ccaaagaggg 4 8840 

accacaacat ctggctagta gcacgtatat 4 8900 

atttaatcaa tttggaggta tgcatttttt 48960 

atcttatggt cagtgactta aaaagcatta 49020 

aaattataca cagaataatg cctttttcag 4908 0 
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ccaatgttat tttagagttg atgaaatatg ttgcattttt atattcttgg tatgaagact 49140 

attaactgta taataaaatt atttaagttc tgatgacgta gtgtcagatg atgttcagtg 492 00 

tagcaaaacg cgaggcacaa ttttttgtaa aaaggacaat atcttgcagt cagtacacac 49260 

tgagtacata aatgttgacc aaatgaagag cacagaatgg ggctgcccac ctggaggcag 4932 0 

agccatgaat tggtactgag gcaaacaagg agagctttgt gtgtgggtgg aggcgcaggg 4938 0 

agggagcgcc aggcttcatg gaggcatgcg agggtctcca gggtcaaaca gacagaacct 4944 0 

cagaactggt gtgaagctta gaaggggctg caggaggctg tgctaaagcc tcagtgtatt 49500 

gctatgggac tggcaactaa acaagtaaac taaaggggtg aggtactttt ccttaagtga 4956 0 

aaaaagtttc cttaacttct ttaaagtgta agctctgttt tcagcttaac tcttgaaatg 4 962 0 

agaagcgtga atccgtggtg gtttcatgac ctgcataatc acactacctg tttttctgtc 4968 0 

acagattgcc ttctctcagc acagcaattt tatggacctg gtacagttct tcgtgacttt 4 974 0 

cttcaggtaa ttttgcttat acagactctc tctgttcttt tgatttaggc actagatcca 4 9800 

ttaatagagt atcagctcct aaaagataaa aagaataatc ctgtgtttcc accacagatt 4986 0 

aaatttactt catatgtgtt cctgaaaagt tagatgtcag ttacagtaat ataaactcaa 4992 0 

tctagcatga caaagtgaaa acgtttggtt tgtatgttct ggcttccagc ggggggcttt 4998 0 

gcatgtatcc tgtattaatt ttgcatgcat cctatagata atttggctct tcatatgggg 50040 

agaatttgga ctaaaacaac atttaattaa taatggttac tgatgtgagc tacacatatg 5010 0 

cttatgcttg ttcaagagat gattctggtg attatccaaa gttttaaagc tgaaagtgat 50160 

tttttagtgc cctgggattt tcttttattt ccatagtttt tttcgtagtc taattttatc 5022 0 

tttatgaaaa tactatagac aaataattga aaaagacaaa aaacgtgctg cagtttttaa 5 028 0 

aacagaaagc agcaatagcc tgccttgcgc cttccccacc ccacgtcccc caatttaaat 5034 0 

tgttttgttt tttacttctc tgtgtttaca cttggttttt cagttaagtt ctcttttgat 50400 

gtcctactgt agaagatgag gatatactgt gttacatggc ctcccctagc atctcccctc 50460 

acaccacaaa agcatacaaa agccccatct ctttcattta ataccattgt atcataatgc 5 052 0 

atggctaaat cagtgatccc tatctatgtt atggccataa aactcttatt cacagctgca 5 058 0 

ttgaacagtg attacactac cacatttttt gggtagaagg atcattcttt gcttatccta 50640 

aatgtagtat ttgccatgtt ttggcttttt aattgtttga tttttctatg tgcctacccc 50700 

tgatttatca ctatactctt gctgatgtac ctttagaaca gtaaaaaata aaataaaata 50760 

aagtatcatt gtactgtgac agaactgaga aatctcacaa tggtcacaca caggaaggtg 50820 

ctctcttggt ttgaatttct ctcggagttg cccaccctag aaccttccca cctcctgccc 5088 0 

ccaagtgcac ttgttcctga ggcctgctgc acaactgtca tttggaaact tcctttgcct 5094 0 

tcctcctata ttgaagtctc tgtttcccag atcccatgtc ttcaccttcc tatccattcc 51000 

ctctttttac tggagcacat tctctagatt tatgagaaaa gatgcctggg agataacttt 51060 

tctgtgagct ctcctgactg aaaatatatt tattttatcc tacaatttga ttgaatttct 5112 0 

gggggcaggt gctatggctc acacctgtat ttctagcact ttgggagacc aaagtgggtg 5118 0 

ggtcacttga gctcaggagt tcgagaccag cctggacaac atggcaaaac cttgtctcta 51240 

caaaaaatac aaaaattagc tgggtgaggt ggcgcaagcc tgtagtccca gctaatcagg 513 00 

aggctgaggc aggagaatgg cttgagccta ggaggcagag gttgcagtga gcggagattg 51360 

caccactgca ctccatcctg ggtgacagag tgagaccctg tctcaaaaaa aaaaaaaaaa 51420 

aaagtctgac tacagaattc taggttcaaa atcattttcc attagaattt agaagacgtt 51480 

atactatttt ttctagtgtc cagtataata ttgagaaggc tagtgctgtt ataattccca 51540 

ctctttgtat gggatccgtt ttctctctta aaaagaaaag tatatttaag tgtaatataa 51600 

agttacagaa acatttgcaa atcctatgtg tgtagctcag tgaattatca caaattgaac 51660 

acttctagag tattgataat attatctctt gttcacccag aacacttcag gcttcttatt 5172 0 

tcttctctgt cagtttcggc aagttgtata tagttttctt ctaatgatcc cattccatct 51780 

gttccattct cttgttcact gagaacaaga gttaatatta ccaatactct agaagttcct 51840 

cttgtgtttc ccctcactca caacctcctg cctcttcccc aaaggtacca tttacctgac 51900 

ttctcaaacc atacattgca tttgctattt ttgaaatgca tatgaattat ttcttttttt 51960 

tttttttttt ttgaaactta gtctctctct gtcacccagg ccagagtgca gtgttgtgat 52 02 0 

cttggctcac tgcaagcaac cgccgcctcc cgggttcaag caattctcct gcctcagcct 52 0 80 

cccaagtagc tgggattaca ggcacgtgcc accacgcatg gctaattttt gtatttttag 52140 

cagagacggg gtttcaccag gttggccagg ctggtctcga actcctgacc tcaagtgatc 522 00 

cacccgcctc tgcctcccaa agtgctggga ttacaagcat gaaccaccgc gctcagtcaa 522 60 

aatgtatata aatgaaatca caacataaat tttaagatga tttggttgat gaagaaaata 5232 0 

ctgaaaactt gtgggttgca gttaaagttg tgcttcgagg ttatagctgt aaatctgtgg 523 8 0 

ctctccacct tagctgcacc tagaatcact tgcggagctt ttgatgtcca agccactctt 5244 0 

cagaccattt gactcagaac ctctaggggt gcaccccaaa catttgaaaa ctctccaggt 525 0 0 

gattccaatg tgcagctaag gttgagaatc actgctttaa aagcacatat tagaaaagaa 52560 
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aaaacattcc tctcaataac tcagaactgc taattaaacc taaagaagta gaagaaagaa 5262 0 

aaatataaat aagagcaaaa attaatgatc taagggaaga agtacagtag agaggatctt 52680 

aaaaactgtc agttggatct ttgaaaagag taagaataat attgacaaac ccctggcaag 5274 0 

attaaccaag gaaaaaagag agaaggtata tggtatatgt agccaatgtc agaaataaaa 528 00 

aagggggcat tcctccaaat cccgcagacg tttttttcct ccagaagggt atttgatgaa 52860 

caactttatg tgaatgaatt tgaaaatgaa acagatggaa tgggaccatt aaaagaaaaa 52920 

tatgcaactt gccaaaactg acacagaaga agtaaggagc ctgaatagtt ctgtatccat 52980 

ttttaaaaca acactaattt aagttttgtc atagtgaaaa ctctaggccc caggtggctt 53 040 

cattggtaac ataacctttt taggccgggt gcagtggctc acgcctgtaa tcccagcact 53100 

ttgggaggcc gaggtgggtg gatcacgagg tcaggagatc gagaccatcc tggctaacac 53160 

ggtgaaaccc catctctact aaaaatacaa aaaattagcc aggcgcggtg gtgggcgcct 53220 

ataatcccag ctactcggga ggctgagaca ggagaatggc gtgaacctgg gaggcagagc 53280 

ttgctgtgag ccgagatcgc gccactgcac tccaacctgg gcaacagtgc gagactccat 5334 0 

ctcaaaaaac aaaaaggaga atactgtgat catcttaatc aatggaggag ataaagcatt 534 00 

tgataaaatt taatacccat tcatttttaa aaaaatttct tagaaaacta gaaagagaaa 53460 

cttctggaac actgataatg gaacaacata aaaagtcaac agcaaaccac tttcttaatg 53 52 0 

gtagaatgtt agaagtttga gtttgggaac aagacagaga tccactttcc acagtttcaa 53580 

ttactcaggg tcaaccgtgg tccaaaaaca ttaagtggaa aattccagaa ataaacaatt 53 640 

tataagtctt aaattgtact ccattctgag tagtgtgatg aaatgtcgtg ccatcctgct 537 00 

tcgtcccacc tgggatagga atcatccctt ggtccagtgt atccacgccc ctatgctacc 53 760 

cacccactag tgtcttggtt atcagatcta aaaatcatag tatatgtagg gttcagtact 53820 

atccatggtt tcacgcatcc actgggaggc tgggaatgta accccacaga ttaagcagga 53880 

cactatattg ccaaatccaa aggatacttt ttggttctcc tcttcttgac ttctcggtga 53940 

tatcaaccaa ctgactgctt catgagcg'ag actcttccag ccgtctctga cccatgctgc 54000 

cctactgctg cttcttagcc tgcctattat gcaatctttt ctattcagcc tctcactgtt 54060 

gattttttcc cagaccttga ccctggccgt cttctcactc tgtgttctct gactggactc 5412 0 

tgccttcccc atctgcagca cgacatgccc agaagtgacc aattccatcc ttcctcaccg 54180 

gtgttcccca cctcactaac tggcaacatg gctctgctgg ttccaagcca cttctcttgc 54240 

tttcactcag caggcactcc cttaccaggc cccagagctg tggccgtcag tgcactctcc 543 00 

acctccactc cccaccccac caaccctagc atcacagtct cgcctggacc ttttcacttg 54360 

actttggccc ctccagaatc cacaaaaagt ccagagaaga cctttaaaga tcatgtatct 54420 

agccccttac cttcctgttg tatctaaaat aaaattcagc gtcctgggtg aactggcgtc 54480 

cgcttttcca tattcagctt gcactcttct cttccacatt tctgctacca ggaccatctc 54540 

ctatgtcttc aaaagcacaa gctctttccc tacactgagc atttgtacat gcttatccca 54600 

ttttctggaa cactcttctc actacccttc attctgtata ctttcatacc ttgcttgatt 54660 

tttccagaca acaacaaatt ctccaattct ctgacaccaa ctcagtgtct cacagttcag 54720 

ttccattcag acactaccca tagttggagc aaatcctgca ggttaggggc tcagtcccac 54780 

aagactgctc ccacttcaga cactaagcac aaatgggtcc ccaggctacc tcactgctac 54840 

tgggccaact acaaattcag ggcttctggt gactgctcac tccaagattt gacaatttgt 54 900 

tagaacagct cacagaactc aggagagcac tatacgtaga attacagttt tattatacag 54960 

ggtacaactc aggaaccgcc aaatggaaga catgaatagg acaaggcatg ggggtgggtg 55020 

agcacagctc tttcatgccg tctctggaca agcctcctcc ctgcacgtgg atatattcac 55080 

tatcctggaa gctcccccaa gcctcatcat tctgagtttt tttattgagg tttcattaca 55140 

taggcatgat ggatgaaatc actttctatt ggtgactgaa ctcatctcca gcccctttcc 55200 

ccttcttgga ggttggggga tggagctgaa agttctaacc ctctaaccac ctagttgggt 552 60 

tttctggtga ccagcccccg catctgaggc tctttaggcc cctctcccca tgagtcattc 553 20 

attagcacac aaaagacact cctgtccctc tgaaaattcc aaggggtaag ctaaattgtc 55380 

tccccttgtt cctcactcct gtagaactgt attatttttc atcgtggtat tcatcacaat 55440 

ttgaagaata tattcatatg tttacttgct taatatctgg gtcggtagcc caagcttcag 55500 

gggagcagtg cactgtctct cgttcactcc tatacacaca cagcttcagc atctagtaca 55560 

taatctagta agatatttgt gggaagtaaa cttagactct tcagggtaag tgcagagtga 55620 

ggaggaacca taggcaccaa cctctcaggg aaaaaagctt gtgtcaatgg cttgctttta 55 680 

agggatgtag gggctgtatt gtagctccag tagacatcct taaattcttg gagcatgtgt 55740 

atggaataag atggccctgc tgccaagtac gtacagcctg gtgaggaccg gccacttcta 55800 

ctggacattc agtccctggg cgaccgaatg acactatcag aataagtcga ggaagtactt 55 860 

ttcaggccca tggcctcttg tgtagtaaac aaggacagat ggcattttca tcacttctcc 55 92 0 

cttgtggggg ttgaacttct ggaaaagaat ccggaacttt ggtttttcta gacgatgttt 55 980 

actaactagc gatgataaac ttttagctaa gaatagagtt actttcatgg catggggaag 56 040 
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aaaatatttt cccaggacaa gttaaccatg tctaaaagaa tttatttagc tttttttttt 56100 

tttttttttt tttttttggg atagggtctc tgttgcccag gctggaacac agtggtgcag 56160 

tcatagccca ctgcagcctc aaactcctga gctggtctca agcagtcctc ctcacctcag 56220 

tctcccaagt agagctggga ctgcaggcat gcaccatcat gcccagctaa tattttcttt 562 80 

tttgtagagt tgggttctca ctatattgcc caggctggtc ttaaacttct ggcctcaagc 56340 

agtcctccca tctcagcctc ccaaggtgct gggattacca gcgtgagcca ctgctcccag 56400 

cctggtccaa aagaatttaa aatggagcct ataaggaaaa ggggaaaggt atccagataa 56460 

tcctaagcac tctgaagaat ggctcatata atatatggaa aaaagaacaa aagtcggggg 5652 0 

gaaattcgat gattcaaatg agaaaatcaa tgtaaagtga gttagaaaaa atatttccag 56580 

actccgggca cagaaaagta aactctaggt cctgacacag tctgttgaga ctgtgtccta 5664 0 

atgagatgct tttccagcct caggaaggag cctccttaaa gagggaaata agagactctc 56700 

ctctgcttgc ttttctgtag cttcttgtag ctggagcagc aagtagaagg gactggggaa 567 60 

ctggtgttgt caacagatct tcactgaata cagaagggcc ctctcacctt ccctgaggca 56820 

tagagggaag caggcacaga ctgtattacg ggtggaccca agagaggttt ctgagtgaca 568 80 

ccagggtcct cgcctctcag cagcagcggc aggctcttga ccaacttaga agctgttgcc 56 940 

gttggcttca tgctcttcct tgtggatttg ctgcctcctt atgtttatct attagaataa 57000 

aaataacaaa aatcaggatt atgcgtggaa gccaagaaag actgagaagg ttattctcag 57 060 

acctctttac tggttcttcc tgttctcctc aacttctcat tgttagggtg gccccaggac 57120 

attctcatct atctgtattc actctcttgg tgaatgtggc caggtttatt gcttgaaata 57180 

ccatatatat ccttatgcct tccaaatgcc atgcccagcc cagacctggc agccagcctc 57240 

caggttagaa tatctgccca ggacatctcc cctgggctgt ccggtagata tctcagggtt 57300 

ctgcatgtct aaaatggatc tactgatatg tccagattcc aaagtgcttt tcacacctct 573 60 

gcttccacag ctctggtttg agccaccatc atctcaccta gactgctggt ctccctgata 57420 

atgccattgc acctctaatg gttttctcag caaacgttaa a'tcacaacac tctgtttaaa 57480 

aacctgcact ggctctttcc caatgccctc ataatgcttg cagtgcccca catgatctgc 57540 

ccttcaccct cacccttctc tctgcctttg tctctggcct tacacacccc actccaaaca 57600 

tgctggcctc ctgaggctcc ccaagccatc ccaggcacgg ccttgcctcg ggctttgcac 57660 

tgactgctcc tttttccaga cgcacttttg caggcgtcca tgagccacct cccatacctt 57720 

ctccaagtct ttgttcaaat gtcaccttct gagaaagaaa ggcccacaca ctctgaccat 57780 

cctgctgtgg agtgccactg gtactcctga accccttgct cttagcatct ttttatttct 57840 

gtagcactta cctacctttt ggcatgctct gtaatttgct tatttactta ctgtttgtct 57900 

ccacctgctg gaatgtaaat tccatgggac agtaatcttt gctctgttta tcagtgtttt 57960 

ccatgagcct agaacaggac ctgacccaat taaagtattg attaaaaaga caatgtaggc 5 8 020 

cagtcacgtg aggcctgtaa tctttgccaa ggcaggcaga tcacttgagg ttaggagttt 580 80 

gagaccagac tggccaacgt ggtgaaacct cgtctccact aaaaatacaa aaattagctg 5814 0 

ggtgtgatgg cgcacacctg tagtccaagc tactcaggag gctaaaagag gagaatagaa 582 00 

tcgcttgaac ctgggaggca gaggttgcag tgagctgaga tcgtgccact gcactccagc 58260 

ctgggtgaca gagcgagact gtgtctcaaa aaaaagacaa tgtagactgt cagaagagtt 58320 

ggcagtgtcc tgatagcaga gaggcccaga gatgaaggct ttccaccaag tgacgatcct 583 8 0 

gggg999ttg gaaggcaatc ctattgaatt gcctgttgtc agggcaccca ttggaggagt 58440 

ctgcaccctt gtacccctta gggaggaggc agtgagcctt gagataaggg cagttcagtc 585 0 0 

atcctagcat cttagagtcc ctctctccag ccagcccagg agggctgtgt tggtgtcagc 58560 

agttgggtgc actgaggtgc cattcatgaa gctgacccca tcctacaaac gcagagctca 58620 

gccagcaaga gagtaccttg atgacactgg cagagagcct tatgtctaca tgtagcactc 586 8 0 

tggaaaaata ccctgtagct agcccctttt acaagttaat gagtaaaaat gcagttatag 5874 0 

cagttatcta tttttttact tgttaagctt attcttcctt ttaaccgcaa gtcagctgtt 588 0 0 

ctctttgcca tcactgtagc ttttcaaatc attccacacc tggtttatac atgagagagc 58 8 60 

attgaaatcc tctgcaacta ttaaggccac tgggagttat tttctcttaa tatttttaag 5892 0 

aaagttgttg gcttgggttt gatcatttca aatgtgtttt atbttactta aaaatatgta 589 8 0 

tttgtaaaaa tattgaatat acactaataa ttacacacat ttcttctgtt tgcctcttcc 5904 0 

ttatgaaaag ttgattgtta ctttaataat taactttttc accttcttag attttaccag 5910 0 

taaatatctg ctaacttgaa taaacagcat gactaatatt tgttataatt atcttagaat 59160 

tatttttctt atttgggata agccactgtg ggccattctg atgaaggcca agtggaaggt 59220 

cagggctctc tctgaatacc acatgggaac gatacagtct agaggcctgg ttaggaagcc 59280 

tggcttaccg aagttgtctc actcattacc tcatgagccg gcacattgtt gtgacaaagc 59340 

atcagaaggg agcccggagg tggcatgtgg gagctgtttt cactgcctgg gcctgaggtg 5940 0 

tgggcaacgg tgggagctga ttctgggggc tgggataata actcagtact ttcctttctc 59460 

acagttgttt cctctctttg ctcctggtgg ctgctgtggt ttggaagatc aaacaaagtt 59520 
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gttgggcctc cagacgtaga gaggtaagct tcagtgggta aagattaaag aatccctgga 59580 

agagcttttt ttccttcttt ttctcttaag caagtgggtt ttagctattt agtgataatg 5964 0 

gacagacaga catctccacg gaaggaataa tgcagccctt tctgcagtcc atggttctgc 59700 

agcatatggt cagtagactt agatgagtgc tacataaagt gtaattagtt ctgcaccaca 59760 

gtgtcttacc ttccgaaaag gaaactctgg tattgacttg gaaatgccag ttttatcctt 5982 0 

cagaccagaa cccaagtcct' gactgaggcc agagatccaa gggcgtatta gcacagccac 59880 

cgcgtgatta ggcaccgcct tcatgagaac ttgtcatgcg agagggaaat cagccattta 59940 

agcatctcaa aaatttttca ttattcaagg aaagataaat gtgtgtgtag ttaagttttt 60000 

aacttgagtt ttatttttaa ataagtcttg atttgtttcc agagcttcat tccaaaagta 6 006 0 

ataagcaaat atttacagct gacaaattta aataaataaa cttggttgca aattacaaca 6012 0 

tatcaaatgc ccatcaatca atgagtggat aaagaaaatg tggggtgtgt atgtgcgtgt 6018 0 

gtatgtatat atatatattt atatacattt gcagcaacct ggatggaatt ggagaccgtt 60240 

attctaagtg aaataattca ggaatggaaa accacacttc atatattctc acttacaagt 603 00 

gggaactaag ccctgaggat gcaaaggcat aagaatgata cagtggattt tgaggactcg 60360 

ggggaaagag tgggaggggg atgagggata aaatactata catcgggtac agtgtacact 6042 0 

gctcgggtga taggtgcacc agaatctcag aaatcaccat gaaagaactt attcatgtaa 604 80 

ccaaacacca cctgttcccc caaaaaccta ttgaaataat aaataaataa ataaataaat 6054 0 

aaataaataa atattttaaa aataaccacc attaaaataa ttacaacata aaaactttgt 6060 0 

gtattgcaac tttcacttgc atcattactt tgcctttcat ttagtcatgc ctgtggcatt 60660 

gttggcattg ttgctaatgg aaatgctcac gtgtatataa gatctagaca aagaagctgt 60720 

ctggggtttt tttggatcta ctcatgcctt tttctatata aaaatgtatt tatagataag 60780 

atttagaaca gaagggaagt atgtaaaatc acaactcaca agcactcagc actgttgata 60840 

ggatattcat gtctgaatag ggttaagaca ggctccagga tatgggcctc ttgttgtggg 60900 

ccacagtacc accttttctg acccacataa agatgatgtt atcatagagg gaatgttgca 6096 0 

cactctttat tattattttt taaataagtg gataacatat tcaaagggtt ttttagataa 6102 0 

ctgtattact gatatacttt aaaagcctaa tgaattacgt gatgttctga aaaaaattat 6108 0 

attttcaatt taatttaaat gtatatttat tcacaatgga gtaagaaaga agtaagaagc 61140 

cacagtaaac cagttatcaa gaaaggagtt gaaaactttt gagtgctact ctctccaaga 612 00 

aggacattag acccagggag tttcaaggtt gagatgagct tatggagatt atgatagcct 61260 

gttcctccct gagtgcttat gagaaataat tgtttctaaa atctgtaaag caaaaggaca 6132 0 

gatggaaagc ttgaaagagt ccgaattgat tttatacaca taaatcttgg taccaaaacc 61380 

tgacaaaggt agcaccaaaa tacaaagggc agctgcatag tgaatgtatg tgcagcagtc 61440 

taattgcagt gtaatgaaag agtcacttaa taccagaaaa cctattagaa tatatcatgt 61500 

caaaaaaaga atatatcatg tcaatgggga aaaaataaca cctgaattct tcctagtggc 61560 

ccagaaatca tttcgtcatt tattgaaact caatatccat tcctaatttg aaaacaaaac 6162 0 

tcacagtaaa atcaaaaata cctttttaac gctatgaaga atatctggat aaccagcatc 61680 

actcatgata aaacaaacaa acaccagagg ttgtatccag gctagttagt tttaacaaga 61740 

gctgacatct gagcatttaa aaattccagg cactgagttc taagtggttt tacgtagatt 61800 

aactaatcct tatctcaaac ctatgaaata ggtattgtcc ctcgttatgt tttatggatg 61860 

aggggactga ggctcagaaa ggtgaagtga cttacaccac atagttaact aagaggcaga 61920 

cgaggcttta actagggact tctgtcttca cagcccaagt tcgttatctt cgtgtaatcc 61980 

tacagtcatt cagctttgtg ctagatgttc tagactgtgc actaagtgaa agggcactaa 62 04 0 

gaagtgttgt tactattgga aaggaaacaa aattattatt ttcatataaa gttattaaaa 62100 

caatgtagta gagatttgcc tgcataaaaa tgaaggctta agtatatcaa aaaataaatg 62160 

aataacacag aagctaaaca atcaactgga aattgtttgc agtgtgcatg agagaaagta 6222 0 

agatgcacag cctacaggaa aaagattaaa at at at at at gtgtttagta tggagttggt 62280 

catctaataa actatgcttt tacccgtgtg atttggaatt tctctgtaac tgtgaagctg 62340 

accattcaga atatatttca tcatttttaa aatgtcagcc attatttatt ccctctcaga 62400 

attgtagaag caatgatttg atgttatttt taaaaacaat agtagtagcg gctgacgctg 62460 

tttagtcctt gctacacacc agccccggtt ttaagtgctt ttacacatgt tgcctcctga 62520 

accctgtctt attgtgaccc cagcacttaa gcacagaccc caaggagcat tgctgcatat 625 80 

attagtcgag taaatggacc tacagcttta aaaaatgtaa gaatgttggt attttaagaa 62 64 0 

aatatttata aaaatgccct tctagaacag agccatatct atccaggcca agattctgtc 62700 

ttctacagta gttccaagga ggcagtttga agtaaggcct acttgtatcc caaggcataa 62760 

cctcaaagct tcagcatcat gcccagaaag atctctaata tccttaatga tatgtgaacg 62820 

gctgttcacc atgtgctagg cgctgtgctg tagtatctca tttgatccca gtgacaacct 6288 0 

tatgggctag gaacacattt cagagggagg acacggatgc ccagagactt caaacacctt 62 94 0 

tcttaaggtc acacagctgt agactattgg agcaggagtt cacatcaagc ctccctaaat 63000 
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cccgagcccc tgtttgtccc tctactctgc aaaatttcac atttgtttgt ttttagctta 6306 0 

taccacctct ctgaaggaag ggttctttca gttcctatct gccgtataat gaggcatagg 6312 0 

tttttgttgt cctgtgaatc cttggtgcca gtttcagtag gaagaaagca agcctgactt 6318 0 

tagtagtaga aatacgagga gatagggcca caccatttta tgtatttgac aagtcgacgt 6324 0 

catagcatta aatccacagt acttttattt ttatctttat tttcttattt tactttaagt 633 0 0 

tctgggatac atgtacagaa catgtagatt tgttacatag gtatacatgt gccgtggtgg 633 6 0 

tttgccgcac ctatcagcct gttatctagg ttttaagccc tgcatgcatt tggtatttgt 6342 0 

cctaatgctc tccctcccct tgctccccac ccctgacagg ccccagtgtg tgatgttccc 6348 0 

ttccctgtgt ccatgtgttc tcatggttca actcccactt aagagtgagg acatggagta 6354 0 

tttggttttc tgttcctgtg ttggtttgct gagactgatg gcttccagct tcatccgtgt 636 0 0 

ccctgcaaag gatatgaact catttatttt atggccgcat agtattccat ggtgtatatg 6366 0 

tgccatgttt tctttatcca gtctatcatt gatgggcatt tgggttggtt ctaagtcttc 63 720 

gctactgtaa atagtgctgc aataaacatc gtacgttaat atgtatctct gacaggggct 6378 0 

tttaaaaaat ataaggacaa tggtgttatc acatccaaca aaatttagta atccctaatg 6384 0 

ttatctaata tcaagtctaa aaagtatcat tttggacttg atttgtttga atcagggagg 6390 0 

atgaggtgtt tttaacatgt agcattaact aataaaaccc cttcagtact cattccatta 63960 

atggctttgc catattattt atcagcaact tcttcgagag atgcaacaga tggccagccg 64 02 0 

tccctttgcc tctgtaaatg tcgccttgga aacagatgag gagcctcctg atcttattgg 640 8 0 

ggggagtata aaggtgagaa tgtgactcag aagtccctat aacttgactt tttaaaactt 6414 0 

aggctcctaa gtctgggaaa ccagagagag caaaagccct attccattac ctcctttttt 642 0 0 

tcctctcttt ctttttaaca ataagctaaa acctgaagtt gctgggtact cttttgcttt 6426 0 

tgtttttcaa tctttgtttc atgtcgctga gcagtcttgt gttcagggga ttttaaaatc 6432 0 

taaatgattt gtctgctttt cttttaagta ttctgtggca tacaaaactt ttatttgaaa 643 8 0 

agagctacat tgaacatcga atcattgttt ttttgtttgt ttgtttgttt gagacagagt 6444 0 

cttactgtgt cacacaggct ggagcgcagt ggcatgatct cagctcactg caatctacct 645 0 0 

cccgggttca agcgattctc ttgccccagc ctcctgagtt gctgggatta ccgatgtgcg 6456 0 

caaccacgcc cagctgattt ttgtattttt agtggagatg gggtttcacc atgttggcca 6462 0 

ggctggtctc gaactcctga cttcaagcaa tccacccacc tcggcctccc aaagtgctgg 646 8 0 

gattacaggt gtgaaccacc gcacccagcc tatttttctt tatattttgg ggaatttttt 64 74 0 

aagcagcaaa atgataatac actgtgtatt aatactctgt ggaaaatatt tatcaccccc 6480 0 

ctccccaaaa tgatgaagaa aatggtaaga ctttctcctt gggcaaaatg atggaatatt 6486 0 

taagatacgc tggagaaata gctatgtatc ttgaataaaa catacttact taaaatgtta 6492 0 

ctgagctcac aggaagtagg taaaatctcc agggaatact ctccttccct accctaaaaa 6498 0 

gaacaaacca taaaccaaaa ccagaaccat gatttgtcct aagtggttcc agatggtaaa 6504 0 

cacaacgccc acctgaatag atggaaatcc tctctataga aaaacatctt caattcagcg 6510 0 

ctgttgaagt cccacagatc aagatgaagt aattaaagat caccagggac aagaggcaag 6516 0 

ccgccataaa taagagtagc agaaacaaca gatgattgat gtggacagga ctccaaatgt 65220 

tagaattgaa atctaaagaa tataatataa ctgtgtatga aatgtttaaa gaaataaaag 65280 

atgaaatcat aaagatgaga gtcaagtatg gccaagcaga tctgaagaag aaccaaatgg 653 40 

aacttctaga aatgaaaaat atacttgttg aaatttttta aataaataaa ctcaatttac 654 00 

atgttaaaca gcatatcaga cataagagaa tttacaaagt ggaagatgaa tctgaaaaaa 65460 

gtacgcagaa tgcagctcag agagacaagg agatagaaag tacaatagat actaagaata 6552 0 

tggaggatag aatgagaaga tccaactcat atatatttcc agaatcccag aacaaaggag 65580 

aggcaatatt cagaaatgat agctttccag aactgatgga aaacatgaat atacagattc 65640 

cagaagcaaa acatattgta agcatgatta aagaaagaaa gggaaggcag gaaggaaaga 65700 

aaacaagttg gatttctcat aatgaaattg tataactcca aagacaaaga gaacatctta 65760 

aaatcagcca gagaaaaagt atcacataca acgtggagaa actgaacgtt agtgcactgt 6582 0 

tgatggaagt ataaagtgat ggagccactg tggaaaacag taccactatt ccccccagaa 65880 

ttaaaagtag aattatatat gatctggcaa tctgggttta tacatagaag aactgaaagc 65940 

agagtctcaa agacatattt gaacatcagt gttcttggca gcattattca cagtagccaa 66000 

aaagtggaag caggcccaag tgtccatgga tggataaatg gataagcaca acgtggtcta 66060 

tgcatccaat ggaatgttat ccagccttcc aaaggaagca aattctgacc catgctacaa 66120 

tgtgatgaac cttgaagaca tactaagtga aatactccag taacaaaaaa ataaacattg 66180 

tatgattaca cttttaagag atacacttgt ggtagtcaag ctcataaaga caggaagtag 66240 

aacagtggtt cacaggggct gggagaaggg gaaaatgggg agttagtgtt taatgggtac 663 00 

gaagtttgag ttttacaaga tgaaaagagt tctggagatg gatagtaatg atggttgcac 66360 

aacaatgtga atatacttaa taccactgaa ctgtacattt ttaaatggtc aagatggtaa 66420 

atgttatgtg tattttatca caatttttgt taaaaatggg aaaagaggcc gggtgtggtg 664 80 
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gctaacacct gtaatccgag cactttggga ggccgaggcg ggcggatcac ttgaggtcag 6654 0 

gagttcaaga ccagcctggc caacatggtg aaaccccatc tctactaaaa atacagaaat 66 600 

tagccaggca tgatggcaca catctgtaat cccagctact tgggaggctg aggcatgaga 66660 

atcatttgaa cctagaaggc agagttcaca gtgagctgag attgcaccac tgtactccag 66 720 

cc tgggcgac agaacaagac tctgtcgcaa aaaaaaaaaa aaaaaaaaaa aaaagggaaa 66780 

agatcacctg caaaggaatg acaagctgac cgctaacttc tcagaagtca gaagagagtg 66 840 

agagaatgtc tttgaagtac tgaaagaaac ctccccacct agaatagtgt gcccagtaag 66900 

caccttccga gaacaaggat gaaataaaga tatcttcaaa ttagaataaa aattgagaat 66960 

ttgccaccag cagacctgca caaaagaaat gtctgaagga tgtgctaaaa tgaaggaaaa 6702 0 

ttacccccag gaggatagac taaagtgtaa gaaagaatag tgagtaaata aaagtataat 67 080 

catatataga aatctaaaca aacattatcc tggtaaaata atatttgtgg gttaaaaaat 67140 

agacaagcct aaagtattga acaacatgat atatgtcagc agcatatgac acagttagtt 67200 

gctccttaaa acattttctt gattttcttc caggacacca cacacttgcc ttctagctgt 67260 

tcttgctcag tggtctttgc tcagtcctct tcatcttttg acagtggtgt gcgtcaggcc 67320 

ctgcctttgt ccccttcttg tgtcttttac acccacttct tcagtgatct tcactatcac 673 8 0 

aggtttaagt accatgatcc tctcacctgg actgctccca tgagcctcca acacaaacat 6744 0 

cagttgccta ctcagtgtcc ccacttggat atcttaacat ggccatggcc aaactgagct 67500 

ttggatgttt atttctcaga cttgctcctc tgacatacct tccgcctttc agcacataac 67560 

acctccatca ccctagttgc tcaggccaaa atcgtggaat tgttcttgac tcattcttga 67620 

tgtcatgttc tcattggaca gccagtctat cagcaaatct tgactccgta ttcaaaaatg 6768 0 

tttcccagcc actgtgtccc atctctgcca ctattcccac ttgatctagg ccaccgctgt 67740 

ctctcacctg gattcttgca gttacttcat aactgatctc cctgcctctg ccattcaccc 67800 

atctagtcag ttccttgtgc cacagcagag tggtactgga aattattttc ttcagccaaa 67860 

ctgaagaaac tgatgacatt tgtgggtata gtcagcagaa ttctaaaggt ggccctcaag 67920 

atttccatcc cacgtttatt cagtcacaca ctactctagg tactgctaga gggattttgc 67980 

agatgtaatg aaacaaggga gttgacttta agatacagaa agtttctctg gctggtggca 6 8 040 

gaaagaggaa gccagagaga ttggaaacat gagaaggact tgacacacca tccctggttt 6810 0 

agagatggag tggccacaga agtttttttg tttttgtgag acagagtctg gctgtgtcac 6 8160 

ccacactgga gtgcagtggc atgatcatgg ctcactgtag cctcaaactc ttgggctcaa 6 8220 

gggatcctcc ctcctgggtc tcccaaagtg ctgggattac aggtgtgagc cactgcgtaa 68280 

gccctgaatg catttcaaag cagagtcttc cccagagtct ccaactaaga gtcccatcca 6 834 0 

gctgacaact tgacttcagc ctgagagacc tcaggagata gctgataatt ccccagaact 68400 

catggaagac atgaacagac ctgtctctct cggcagagaa ccctacagag cttgctggga 68460 

cttctgacct ctaaagctgt gagataataa gtgggtgttg ttttaagcaa ctaagtttgc 6 852 0 

agtagtttgt tacactaata taagctgggt gcggtaggtc acacctttaa tcccagcact 6 8580 

ttgggaggct gaggtgggag gatcgcttga gcccaggagt tcaagacctg cctggacaac 6 8640 

atagtgagac cccatttcta ttctttaaaa aaaaaaagga aaggaaagaa aaagagaaga 68700 

aactaatata ggggacaaat ctcatctaaa ggctgcaaca gaagatgaaa acatattttc 68 760 

cagagccaaa tgcatccaga gagtagacaa agaatctact tggtcagcca acccagcttt 6 8820 

tacaagcttt tacatccagc aggcttgtct acttagtgtc taaaggacag acaaaataga 68880 

atgtttttta agtctccatg atatctgcat ctgtgtttga gccaggtatg cagaactatc 6 8940 

tgacaaatct accaagcact caccactgcc attcctgtga cctacagtta tgaattagaa 69 0 00 

tttcaaatcg agccaaagta cagcatccag aagaaaccca cttggcgtgg ccccttccct 69060 

acagttcaag cagccctctg gaaggggact gcgagacctc agtggaaggt agcaagcagt 6912 0 

accacccgtg tcactggtgc tcagagagta ctgttagctc tctcctgccc tttttcatta 69180 

tcttaaaaac tgcaatgtac cttttttata tacctgaaat tgatggttct tataagcaca 69240 

aatgaccgtt aagttgtccg tatacattaa agcttgagta tcacttaagg taattttgtt 6930 0 

ttcttctgcc agactgttcc caaacccatt gcactggagc cgtgttttgg caacaaagcc 69360 

gctgtcctct ctgtgtttgt gaggctccct cgaggcctgg gtggcatccc tcctcctggg 69420 

cagtcaggtg agtagatgcg gtccagcgaa agacaccttc taagcatgtg agggagctaa 69480 

gcatgggata cttccctttc ctagagaaca aggttataaa ggtgataacc aacagtgcct 69540 

gcccatgcca tggcaaggat tcagtgagat aacccatttt cacctatatg cccaaaactg 69600 

ttttcccgag gtcatcgtcc cttcctgggg ggacgtcccg ggacctcctc accgttctgg 69660 

cctccccagc ctctcagtgc tcgtggatgc ctctgaccac accttccttt gctgtcggag 69720 

t'ttctggtcc ccctttgcct ctgctgctct ttccctgtct cttttgccct gtcctgcccc 69780 

tgaatctggg tgtgctccct tcaccctgca tgttcttcct cagttgcctt ctccactcca 69840 

caacttccag ccatctcccc tgggctggtg tttcttcaat tggtatttct agccctcctg 69900 

catttctttt gctttttcac tgttttaaat ataaggcaca aacacaacga taacataaag 69960 
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gaaacgtctg aaaatccacg cacagttctg 
ctgtgttctt tccagccctg gccagcaatt 
tattctgctt ttttcatata actatcttaa 
accaatgcat gtgacacttc tccatcaaat 
ttgaagacag aaaatatgga atacagtttt 
tggaggatgc ctcaatgtga tcattttaga 
gggttacctt agccagagag caccctgtgt 
ccaggccctg agctccactg tcccttcagt 
cagctttaga atggttagga tctatagtat 
ggccaggatt cattgtcttc atctcaccta 
gcettgcctg tgttgaggac aggacaactc 
ctcctgccac cgcccagccc agcaaacagt 
ctggggcggc aaggtgacag gtgtgcagcc 
gacatgcccc aaccacctga aacacctaat 
aacagtgggg ccgtccttgc tgccccttct 
taggctccca gagcattccc cactgtgtgc 
cactcccctt cacaggctca acaaggcaag 
ggtctcagca gagctccgag cctgcgctgg 
cccccagaac cgtgccttcc ctaatggatg 
aaaatgggat tgcaagtccc caaagcaaag 
gtagttgcta ttgatcgtgg gttgctgcgc 
tactaagact ttgtcagttc actcattgat 
cccattttac aaatgaggaa actgaggtgc 
agcctggccc cagagcctgc cctgcctgta 
gactgagtga taggggtgaa tctgaatctc 
cttttggagg aaggcaggag aaattgggca 
gttcccacca ccaagcagag cactgtgctt 
tttgtgtaaa aagtagggaa aggagaagaa 
caaagaatac ataaaaatta gtgaaagcgt 
ggtcagggtg gaaggaagac ttttccctgt 
ctggtttttt ggtttgtttg ttttgcacca 
ttttaaaaca aaataaaagc cagctgctcc 
ggtcttgctg tggccagcgc cctggtggac 
gagaagtcag gagccgtgag aaaccggaag 
atctgatgct ggggccaggg actctcccac 
atctgcaggg aagggcgtgg cggggaaatg 
ccctcaaagc atctgactca cctgcatgat 
cgtgttccag catctaacct tttacttttg 
cagggatgag ctgatggttg ctggaggagg 
tgacactcag gttcactgtg gaaaactgtt 
aaagatggag tgtttacaag tagacattcg 
aaaactagtc agatgaatta acttgttttc 
gtgctattta ttcttgcacg atttaggcaa 
ctagttgaga attaataatg gtccatctct 
ggctatttta aatgtcaagg tcagcagtgt 
gttttctata gtaacttgat taatttagtc 
tctctgcctg tccctctcct tctccatctc 
acacatacac acaacactaa gtgcctagac 
taagcctaag tttttacata attgcattcc 
ttggcaatct gctttttttc taaaatctga 
aggaacattt gatctagcag cagggatgag 
ctcctgtttt tattatcaaa aaggacactg 
tgtcacccta ggcctgataa gcgatcagag 
agcagaaaag agcactgaga gcacttggga 
gcagcctcct ctgaacttgt ggttcattct 
gggacagcct cccattgtca ggcagaagct 
ctctttcccc caggaggggc tcgacccacc 
gtgaggaggg cagcaaatgc tgacaaggat 
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ctatcccaca acccaccaat tttcattgtt 70020 

tagttgaaat gattgtatag atctaatttt 7008 0 

acatttttat agcctactac ataatatttt 70140 

atatgaatca tgattatata aacttctatt 70200 

gaaagtatct cgcttaatta cagagtgctt 7026 0 

caactttcag ttgagatgat tctggagtca 70320 

ctggtggaaa ggctgcccca ggatactggt 7 0380 

gcaccctcag acccagaaga tcctccagcc 7 044 0 

cttcacgagg gaggcctttc gacatggctc 70500 

gggattcctg tgagttgagc ccctaactcg 7 0560 

tctcgtctgt cactcaagtg cagtttgtgt 70620 

ggctggtgtg ccctgagggc agggtggcgg 7 0680 

ctgtagcctt tcctgctcca gcctttgcct 7 0740 

aaaggtctgc tcaatgcttg agacccaccc 70800 

gtaccccctg ttttaagtgg tgcccttttc 7086 0 

caggcactta cagctgtcta cccccacccc 70920 

taacacgctc cactaacaca actcctggca 70980 

ccatttccca cgtatattag gaatccagct 71040 

tcagattact ccgttatcaa tacaggaagg 71100 

tgggcggcgg ggcagtgatg agcataaagg 7116 0 

gccagatact aagactttat gcatgccagg 7122 0 

tcttcacact agccctgtac tactattatc 71280 

agggaggtta aagaaccttg cttattccaa 71340 

tcgtggtatg gaatggaagt gactaatgct 7140 0 

gtcataaagc gtgctgctat ttctcggctc 7146 0 

cttagcacac gtctacaatc tcatttatat 71520 

ggttgatgtg ctgaaaagca aactgcacct 71580 

tgtgtatttg tatttgcata agtataccct 71640 

ttacccagat ggaggggaag gcggagggag 71700 

gtatgtgtac ctttttgggg gttttttgtt 7176 0 

tgggattgaa aaataaatgt ttaaattatt 71820 

cagcatatgt tctctctgtg gttctcccaa 71880 

atttctcagc agatgccgat agtgtacaag 7194 0 

cagcagcccc ctgcacagcc tgggacctgc 72000 

gcacgagcta gtgagtggca caccagagcc 72060 

gctgtgcggt gcgggacgga agactggaaa 72120 

cacaagcttt ctttgacggt ttctcccatc 72180 

cataggaaat acttgattta attacaggtc 72240 

ccagtgtaga gccagtgaga gaactaggaa 723 00 

cttgggactg tctcaactgt gcaaaaaaca 72360 

tcatcagttg ttcttgaaca tggtctttta 72420 

atctgaagcc tgctatcttt tttaaaagat 724 80 

ttatctctcfc tccagggagt accttttttt 72540 

tttgatcata tcaagctagg atagaagggg 72 600 

tactttgaat gtaaactggt ataataggta 72 660 

ttaatccatt tgaaactctc tcttcctttc 72720 

accctccctc tctcacacat acacacacaa 72780 

tttaaataga tcfcagcaafct ggaaagttag 72 840 

tacattcttg taaaatttaa atagctacca 72 90 0 

tttgcagcca ggaaagaatt ttctcaccca 72 960 

aggaaagcag aaatgaatga actgtgaaag 73 020 

tcaagaaggc gccccctgcc cccacccccg 73 080 

gaaaggactc attcatgtca cgcttccttg 73140 

cccctggatc agagagcatc tgtgtgtcct 732 00 

caggctgggg tggactcaga tgccaggaaa 73260 

gcccaaagcc tggagaagga cttgtttgcc 7332 0 

caccctccct ctcagaccaa ggtggtggct 733 80 

gaaaagcaca tggaaaaaaa tggacgagga 7344 0 
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gggaaaactc tgccaaatgg aaaatgacca aatttaagag ggtgggacag tcccctgctc 73500 

ctctcccaga gggcactgct tggaaattgt gttttcccca tttatggtgc tctgtattct 73560 

ggcattatgc agcagcctcc cagaagctct cttctgcttc aaaacctggg atctctggca 73620 

ttaccctatt gggatggacc gctggacagc aatgctcgag tttgtgaatt tggagagata 73680 

ctcaaaagag ctaaaactgc agcattttac ctttaaatgc agtgcctaga gagagagtat 73740 

tgtctcttcc ccaacactaa ccccactccc atgaagaatt gcctggaaag atgttttcaa 73800 

ggaatttgaa ccataaaaca ctatctgatg cacagaacac ctctactttg agactcacct 7386 0 

ctcataaagc ttctttttca cattactgtt aaagaccaga cgttctagaa aagacccctc 73920 

ctctcatgag ctcccccatc cctgctacag aacacagcac ccatggcgcc tgcagtggac 7398 0 

tggcccctta attcccacag gcccccccag caaggccaaa gggaggcccc tgggtattgt 74 04 0 

cctcctacaa ggaagatcct ctttgtttgt tcaaaggacc agttttccta ggccaaagaa 74100 

gtctcttccc catgttagtc ctatgccttg aaatatcatg caccatgacc cacagccatc 7416 0 

tggttatgtc ttattttttt cctaaaagat aatgtttatt tttaaaaagg aaggaaggag 7422 0 

caagtgaagt ttcattctgc tccagcg'gtg gggaagccgc tgaatccacc tgcttctcct 7428 0 

ttgcaaccga cagcaaacag ctttctccgg cctcagggca gaaaaaggga atggcaggga 7434 0 

gtaagaggcg ctgggctcgg agcctgtttc caagaaggaa ttggttgtca tctggcagtg 744 0 0 

ttgcgcgtca caagagagcc tgtatataaa ttaaaatagt caagacaaca ctgaccttgc 744 6 0 

acttgtacat aactatacag tagtgtccag aatgttcaga cattcggagt gtacataaaa 7452 0 

cagaaaaaat cttcatgtat ttttattaaa tataacaatg tctgagtttc acctaagatg 74580 

tttttgtgcc atatgctgga tatccaggtt ctcgccaggc cccgatacat gaataacaaa 7464 0 

cccaagaaac gcatccccat tgtgtgatgt gttcagatgc atctggcacc aattaggtat 747 00 

ttcttaaaac aggactcatc tgtcagagtg cacatgaaaa atcaggcagg gaatcgaaac 74760 

gacagcgctg gaggagactc aggaagcaga ggcgtccctg ccgctgccct tggccctgca 74 82 0 

agcacatcat gaccctttct ggcagcctct tggtgctctg ggtagtgagg gatgaccagt 748 8 0 

cttgtcctga gaaatgtttc tcttagtctt taagttcaaa gactaacctg tagcaatcag 7494 0 

actttccaaa agggggttct ccattttttg tagttttgtc taaattttta atgaccattt 75 0 00 

cctggaatca gtttattata ctgaaaactg ggggtgggag tagggagcta gtttgttgat 7506 0 

aaatagttcc catttccccg tggagaattt gacataccct ggactcctgt gtgcctcctg 75120 

ccatccctgc acacagcctg gggagaagcc tgtgcctccc cgtgtggaga gaaggcaacc 7518 0 

ccagatcccc tgagctaacc cggaggaaag gcagtcctgg acagaagact gtcagcagaa 7524 0 

ggaaagtact ggactacccg tgggtaagtc ctgccattca agactggaga cacctgggaa 753 0 0 

ataaaaagag cagggcactg ctggtgggaa gaggcatttt accttccagt gcaaatcctg 75360 

ctcctttgat ttaatggggt gtactggggc caggggctga ttcacttcct tgggagatgg 7542 0 

tggtgttttc atgaacatct ttgatccttc catttcattt attcatccat ccattcaaca 754 8 0 

agtatttgct aaacactaac ttaagctaat gctagggtag tgactgagat gtaaaaatag 75540 

attttagaat taaaacaaaa tccaagtcct cacacccctg tcatcccagg agatctttcc 75600 

ttgtggtggt ttctgtgaga attggccatc ctgaggacac agccaggacg gcagaggcct 75660 

cctggcctca gggcatgccc tgcctacctt ctgaaatgtt taccccattg accaaacttg 75720 

gctccagcca ttgcggtggt ttctagatag ccaggcccac caagagatat tgccccttga 75780 

tgagagtcaa acaccctgcc tacaaggaga tgttttgaaa tggagaggaa aattggcacc 7584 0 

tcatctttta aaggcagtaa tggaattgat tttcagtaac tgaatttgtg cacaaaacat 75900 

tctaaacact agtgaagcct gtttcgttga actaattaat tctggctctg gaaatgtttt 75960 

tgttttatag ttatttacga tttcgtttgt ttggattcaa gcttagtttg ttaatatgta 76020 

taatttagca tctattacac tcatgtaaat atggagtaag tattgtaaac tatttcattg 76080 

cggggattgt gggtgttata catacattta ggactgcaat tttttggtat ttttttgtat 76140 

tgtaaaataa cagctaattt aagcaggaac aagagaacta agggaggtct gtgcatttta 7620 0 

aacacaaatg tgaagaactt gtatataaac aaaagtaaat actataatac aaacttcctt 7626 0 

ctgaaataaa agtagatctg gtaaaaatgt ggcttttgtt ctgagtgttt cattttgatt 7632 0 

ttgcattgtt ttgcctatat ttacattttg gtcattagaa ccttaaggga aaaaaaaaag 763 80 

cgtttaccag ttttcagact gcgtcagagg gagcacttga cagttaacgg aggtgagggt 76440 

gaaaacccct ggcaggtacc tcagacagcc ctccaggcag cttgtgagcc acagaccttc 765 00 

cacacagcct ttgtgtgcct acagggcagc tgggcccgtg ggggcaggga cagatctgag 76560 

cagtgaggta tggacagccg tctcactgct ccactaacca caccactgtc tccagtggtg 7662 0 

ctgttatccc tccactttgc gctctcctca caggctcctg aatgtcctct aggttgtctc 7668 0 

caaactgcta aatctcaggt gtgctgacag gtctcagccg tggccagtgt tttcctcggc 76740 

cttgaagttt tgaggaagtt actcccattg gcaaggcctt gtttgggggc ctcctgtgtg 768 00 

tcctgccccc ttttatttgg ctcctgccct cagaatgggt cagagtcatg tatttcagtg 76860 

cagagtgcag attcccagtg ctagcagttc cttccagaag gatgaggtca ttgaaagaag 76920 
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gaggaaacac aagacggtca ctcctgtagg ggaagatggc agggtactag ggccagggcc 769 8 0 

tgaggtgaag gcaggagcct ggaggaggcc tgcctaccag agctcaggtc tggcttaggg 77040 

agggaaatgg ctggactagt ggctgcaaac aagttgagaa ggaccttgaa actcttccag 77100 

ggagggttat gagagccacc aacagtaaca ggatagaaac cctgtgattt ggtggtactg 7716 0 

tggcagagac aatgctgtgt ttaccaaacc tactgggccc acagctcatc atttcccagc 7722 0 

ctccctgcag ttaggttggg accaagctaa gccatggcca aggaaaggtt ggaagaagtg 772 8 0- 

atgggcacca attccaaacg tggcacacaa atgctcccac aggggactct cctacggatc 7734 0 

acatgttaaa gactggaggg actgtgggtc cctgagtcac cccttagagg ccggccgcct 774 00 

gatcagtaag agctgtgctg gcctttgcat cactggaaag caaactcact ctgctgatcc 77460 

actgagattt ggggctttgt tacactagtt agcttttctt attctcacta atatacaaat 7752 0 

cagcactttg aagtgcagtg cacatgtaac caaaccctta agtacatggc actggctcag 775 8 0 

cgtgtggcca ggtggcgagc agtgataaga caggtgcagc tggcaggaca actgaggtcc 7764 0 

ctgccaggcg gagcaaaaca tctggtaagg ccgtttcact cgataactcg gaaaacaaac 77700 

tccatgtctg ctgacttttt gacttgaggg gaataaacaa caacaacaac aacaaatatg 7776 0 

tgttgattgt tactggctgc ttttggcgag gcatcacgag aaaaagttta cttcaagaaa 77 82 0 

gaattagcca gtttacaagc agaagggaaa aagatggaaa gatgttaacc agggccttgc 77880 

aattctggaa aagccaactg cttctagtct ccaaccagta agaggtgcag gtgagcaaca 77940 

aaggccccat gagacttagt tgaacacagt gactgggcct catggtgacc atccaatgaa 780 0 0 

gggtgtggtc ttcccactgg agcatggaaa ctgcaaagcc accagcttga gggaggggca 78060 

cgtggttaag gaagcaaaga aaaagctgac atgagaactg tttccaggaa caaaccatga 7812 0 

atgtagtcac tgaccctgga cctgcctgga aataaataaa gatgtagtaa atttaagaaa 7818 0 

ccattagcaa agccttaaaa aaaaaaaaaa aaaaaaaaaa aaacggtggc aattgaggaa 7824 0 

gtcttaaatc caccttcagg caggaaaatg gttgtgaaag ctgtgtccag cgagggcttt 783 00 

acaagctcag atgtgggccc aagagcagaa ctgggcctga gcgaggactc cccccagccc 783 60 

aggacagggc ccagcaggat tgcaaaaaca ccgcagagca gcaattgctg catctgcacc 7842 0 

cgctccttcc tgggtaggag ggcctaggag ggtgtcctgc ctgttctcat cactacctgc 784 80 

tgggtgggag gtggattaag ctgtcttgtt aattcaggat tgtggacttc aaagagccac 78540 

ctgagggagg ctgcatcacc tggaggtctg ggcggggtgt gacctcgtat tgtcccccta 78600 

ggagagggtg cctgtgaccc atgtaggaga ggaagcacaa aactggtttt ggtgatcaga 78 6 60 

atagcaaact gggccagatg cttaccacac atttcctctt cctgagtcca cagcttcgac 7872 0 

tgcatggcct gggcccgtgg catctaccca gatgccttgg gaccagccct caccaatgat 787 8 0 

gtgtcatttc aaggccattc acagtgacag tgtctcctcc actgtctcca tcaaccttga 78840 

aggcttatgg tgaagatggt aatacaacag gattgaagga gactgggtcc ctgaatgact 78900 

ttgtggaaca gagcactata ggccaaaggt agtagaatta ggtatcttgt tttatgctgc 78960 

ttaagtgaat taattaattc tctagattta ttcacctagt tattcaacaa atttttattg 79 020 

agcacatcag ggctagggca ttgtgtcagg atcagagatt ttgcaatgaa caaaacagaa 790 8 0 

aactccatgt cctccagggg agttgataat aaaatgtcag ataagctctg tgaagaaagg 7914 0 

taaaagtagg gttaggggta gaaagtgctg gggggattgc tacttaacat agtgtggcca 792 0 0 

ggggaagcca ttctgctaag gtgacagctg agcagaacga tgagtggagg gggcaggaag 79260 

gaatgtggag gtctgggaga agagcattcc aaacagggat ggcaggtgca gaggccctga 79320 

ggggg ca gca tgcctggtag gcctgctgga tatttctgag aatgaaaaag tccattccca 79380 

gttgtttgcg tggagaacct aacccagtta taaacacaaa gtacttgaga gctggtttaa 79440 

tatacagcca gctttccagc agtgcaacta ctgtgtacat caagggaaaa ctgaacttcg 795 00 

ttttccttaa aacttatcat cagctggtca tcattttgac aaattctgtc aacaacagca 79560 

gtgtcattcc tggcatctgt atgggtcacg tctgaacaga cacacgccct gcagccctgc 79620 

aggtaccagc tgtataacaa gaactccctt ccaccctgtg tcctggaaac aagaaagcca 79680 

ttagaccgga agatcccgat ggctatctca aatgtgctgg atggagttgc cagggcccac 79740 

tggcatgccc tgtaagcctt tccttccacg tttggttcct gccccttgaa gactccattt 79 8 00 

ctgagtttgt gtgtgtttta ctttctagtg tgtgtcctca tcttaatttt tctctctctc 79860 

ttctgccttg aactgaaggt tcgcttgggt gtggagagac aggcccccag cagagcagct 79920 

tcccgagaca tcctccgatc cagggcttcc cagcagcccg gcaaggcagg gctgtgcctt 79980 

tctgcttcag ctcacaagca tgccaggctc actggcaagc tgctgtctgg ttgagggact 80040 

gctcctaaag ccctgcacag cccctgtcct cctggccctc tggaaattcc acccccgtgt 80100 

ccacatttca tgcaaaaatg agctggttct gtgagcatgg cccggcctga ctcgcttagt 80160 

gggcggtaag tggtttccac ttcaaccttg cacctaatca ccgggctcca caccaggatg 8022 0 

gacattcatg agccgtgaag tttccagtaa taaatccaca gatgcttcca gcacctgcct 80280 

tttcgcatca cctccactcc cagccacctg ccaggcaaca ggtaacagag acccagtcac 80340 

aggagggcag tgtgggggca ggactgcagt ctcccaaagc ccatgcacaa aaccgacagc 80400 
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gccctggcag gacaaggagg ctgacattca gatgtggagg aacaaggcat gacccattcc 80460 

tggtcatggg ggccacagct ggactcagcc ttgaggcttg gccagactta acaccgtgta 80520 

taaaccagga cctttttagg tagagtaatg gaaaccaaac tctaatgatc ttagacagtg 8058 0 

ctattagtct cctggagctg ' ccagaacaaa ttaccacaac ttcagtgtct tgaaacaaga 8 0 640 

gaaactgatt ctcacagttc tggagaccag aagtctgaaa tgcaggtgtt gccagggctg 80700 

tagtctctgg agactctagg ggaatctgtg cctacctcct ccagcttcca gtggctcctg 80760 

acattccttg gcttgtggct gcatcacccc aatccctatc tctgtcttcc cctggtcttt 80820 

tgctcaaaat gtctgtgttt agtttccctg tagacacctc tgcatcactc tcataagatg 80 880 

cagaggtgcg acatacaggt gttgagagcc cacttagata atccaggata agctcctctc 80940 

aagatctgta acttggctgg gtgcagtgtc tcacacctgt aatcccagca ctttgggagg 81000 

ccaaggcggg aggttcactt gaggtcagga gttggagaac agcctgggca acatggggag 81060 

accctgtctc tactaaaaat agccaggcgt ggtggcacac acctgtggtc ccagctactc 81120 

aggaggctga ggtaggagga ttgcttgagc ctgggagttt gaggctgcag tgggctatga 81180 

ctgcaccact gaattccagc ttgggtgaca gagtgagact gtctcaaaaa aaaaaaaaac 8124 0 

ataaaacata acttaaatca catctcttgc cacagaaagt aatactcttt tgcctacata 813 00 

taaggtaata tttacaggat ccaggggtta ggatgtggac atatctttgg gaccactgac 81360 

agccatgaag caatctcata attttcaaat aggttctgtc ctttttatct ttccagtctt 81420 

ttggaaagca tatgcctata ttttcaatcc acaattctat ttttatttga ggtcatttca 814 80 

tttctggttt ttatttttta ttgagacagg gtctcactct gtcacaggct ggaatacact 8154 0 

agcacaatca tggctcactg cagccaactt ctgggctgaa gtgatcctcc agcctcagcc 81600 

tcctgagtag ctggaactac agacacacat caccatgcct ggctgattca ttttttaatt 8166 0 

ttttctagag acaggctcta tgttgcccag gctggtctca aactcctggc cccatgcaaa 81720 

cctcccgctt cggcctccca atgtgctggg attataggag taagccgcct tacccagcct 81780 

ccagttttat tgtgttttgt tttgttttgt ttgagacaga gtcatgctct gtcaccaggc 81840 

tggagtgcag tggcacgatc tcggctcact gcaacctctg ccttgcgggt tcaagcgatt 81900 

ctcctgcctc agcctcccga gtagctcgga ttataggcat gcaccaccac gcctggctaa 81960 

atttttgtat ttttagtaga gaccgggttt caccatgttg gtgaacacaa agtatttgag 82 020 

agctggttta atatacagcc agctttccag aaatgcaact actgtgttca tcaagggaaa 82 0 80 

actgaacttc gttttcctta aaacttatca ccagctggtc atcattttga caaattctgt 8214 0 

caacaacagc catgtcattc ctggtatctg tattggtcac atctgaacga cacacgccct 82200 

gcatgcagcc ctgcaggtac tggctgtata acacgcctgg caagaattcc cttccaccct 82260 

gtgtcccgga aacaagaaag tgattgtgat ccactcacct cagcctccca aagtgctggg 82 320 

attacaggcg tgagccactg cactcggctt cctcccgttt tttttttttt tcaatgctta 82 3 80 

tattttactc taattaactg agtcaaaaat tgagaatagt tgaatacact ttcatgtaag 82440 

gcgaatcatt tagccgatac ttaactctgc atttgggcta ccatgccgct gtggttagcg 82 500 

gggcaaggtg atgagccctg tctcaacaca cacaccccgc ctctccccag cccacttaca 82 560 

cgcgtccatt cccacgcagg tgtgggggcc ttagaggatt ccctcttctt cgtaaagtga 82 620 

gaatgggctg gactcggctt cactgcccaa caactccttt ttttcctttg ggaaactgtc 82 680 

cttcccattc catataacct tcatgggctg agatcactca ccgagctaca ggaagaggcc 82 74 0 

cattactatg gccccatcag ggttctgcct gggacagtcc ataaatgctg gaagagagag 82 8 00 

gtgcctttcc ctactgaggc tgctaaaagg agacactgca aactggggct gctggtggca 82 860 

atcttacact ctcagtgaaa gcctgcctgg ctgcagggga aaccaatgca cagaccagca 82 92 0 

ggggcaacat catttgaacc cctggagaca gctgtgcctg aagcccacat ggctcaggca 82 980 

catgctcaag ccactctgat ttgcttgtta cagtcaagag agtccttggc cgggcacagt 83 040 

ggctcatgcc tgtaatccca gcactttggg aggccgaggc aggcggatca cctgaggtca 83100 

ggagttcaag accagcctga ccaacatggt gaaaccccgt ttctactaaa aatacaaaaa 83160 

ttagccgggc atggtggcat gtccctgtaa tcccagttgt tagggaggct gaggtgggag 8322 0 

aatcgcttga acctgggagg tggaggttgc agtgagccaa gattgcacca ctgcactcca 832 8 0 

ggctagctaa caaagcgagg ctctgactca aaaaaagaga gtcctgactg aagctgagag 8 3 340 

gctggtggac agctgtcagc aagcagtgtt tgtgagtctg atgtgagctg cgaagtccac 83400 

acctgatttc agagctggtg gctctttttc caatcaggac agctccaggc ctgagttttg 83460 

gtgtggtctg tacctaaccg ctgtgtctta ggtcaggatc tctagaagcc aagcccaaga 83520 

caggagttct caaatgccgt gatttactga gggaatgcct tcgggggaac ctgccagtga 83 58 0 

ggagagctgg aggaggcggg caggggcggg agctgagcca ggtgggggtc cctgctggag 83 64 0 

tctagcctca gcttgacccc acaggggctc tggagcagga acagcacact gcattgtccc 837 00 

ttgaggcagt cactggctgc agttgcctct gggagcagag taaaagtgga caggcatttc 83 760 

tgggagtaca attccctaga gaaggggaca gctctgagct gtgttaccag ccaccatttc 83 82 0 

caggggctgg gggatgcgct gcactggccc agagaggtct ctgagcaagg cccccacaac 83 880 
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ctccacttca acctcctgcc aaacctcaag cagattgaga aggagccgct agagacacag 83940 

gaaccctttc ctcatcagaa gtctccaagc tttggaggaa gaaggaggca ggcaggaggc 84 000 

gggaggatcg tttcacccca ggggttcagg gctgcagtgg actatgatca tgccactgca 84 060 

ctactgcctg gtgacagagt gagaccctgt ctcaaaaaca aacagacaaa aaaaatcaat 84120 

gctcttagca tctgctgggt ccacattggg cagtgtgtgt gacgtggtcc agctgcaagg 8418 0 

gaggctggga aatgcagttt cattatctga gtactagtgg agaaagaaga gacgaaagca 8424 0 

atgagaatcc ctgccatgag gaagtgagag ctgattcatt tcagcttcct aaggtgaggt 84300 

gatggggccc agacaggttc tgaggccact ggaggtcaca caggccagca gggatacacc 84360 

cagtggtaga ccctggtctc caggctccca ggacacagac cacataaaag caaggtcagt 8442 0 

ctcagcacac agatttaata agcactgtat gttatgccta cagctgcagg aaaccagggg 844 80 

gaagatggcc acagggccaa tgccaggcag ggcacagggt ggggcccctc aagcagcata 84 54 0 

tggggaacag gaaagactct gccagcatgg tcatggtgca caggtggcta caatggtggg 84 600 

taatgcccag ttgggaggat gcagtgttag acggggtctt gtccgatgga ttccttaggg 84 66 0 

ggttatatgc ttgggaggct gatacaattt cctagaattc tgggattctg gatggtctct 8472 0 

gacctgctct aggggatctg ggtctccaga gaggtctcag ctacaaaagt gaccctctcc 84 78 0 

tgaccccacc ttgtgggaga ccccagtggt ctcaaggtct gtgaataaag ggacttacaa 84 840 

ccaagatgcc tcctgacagg gagacggggg ctttacagtc aggccccagc ctacatccct 84 9 00 

tgccccaggg gacggcacac agggtgtcca aacctacaaa gggcagcgga gtgaaagcac 84 9 60 

cagggccggc ttggggggta agccagcccc ttctcaaggg gccagtaggc cacagaggcg 8502 0 

gcccaggcag gcagggtgga gtagctggct gtgctatccg aggtcgctgt cctaatcaga 85 080 

gc agggc c gg gagagc cagg ac aggaagt a t ggagagc ag ggagcg t c t c c c aggg cc c 85140 

tgcctgtgga ggacaccttg ggggtgggag ccaagtgcag acttgaggca aaagcctgtc 85200 

tctgccctct gcctgcaccc ccaccccagg cccttcatgc ctccacccag gcccaccagg- 85260 

gacggaaggg tgtgaaggtg acagtggggg aggtgccagc tcaccctccc ctccctgcac 8532 0 

ctacctgcag gaggctgtgc tccgggtctc cctgggggtt cagctggtcc agcaggactg 853 80 

ggggccaccc gctggcaaag gcctgaatgg caccatctat ggagggaggg gtccagggtg 85440 

gacttagctg ggaaacccta gcacagggct cagcacaggg gacaaagtga gagaccccct 855 00 

gctgcaaaga ctggctcctt ggacagaagg tgtgtggtga ggaataacaa aataataagc 855 60 

acagctgtga cagagtgact caactgtaca gtcttccatt ctcccctcta aaatggctcc 85 620 

cagccaggcg gcttgggagg gggcagtaag cgccgccccc actccccctc cac tcccccc 8568 0 

cgggcccctc acccaagcag cggttcctgg taaagagccc ccggaaggct tcgcagtcct 85740 

cacgccggtt cccgctggct ccgcagtcgc accagggcgc cacgcgcgcg ctcacgttgt 858 0 0 

ccacgtagtt aggggtgacg gcggtgcctg cggggaccct gagggcgaag tcatcggccc 85860 

acccgggcgg agatatcccc tggagaaccc ccgccctcgc ccggatcccg gccgcgcgta 85 92 0 

cccacgaggc ccgcgtaggc gcgcaggcag cgggcgccct ggtccagcag gcagccgtcg 85 9 80 

ggggcgctgg gcgctggggt gcacgagacc tgaaaggcca ggaggcgagg cctgcgcggc 86040 

gggagggcgg tgagccggag agaggccccg tccccaccct cgccacggcc ccgccgccgc 8610 0 

ccgcgcgcac ctgcagaccc ggctgcgctc gcagaagttt aagggctcaa ggcaggaggg 86160 

cggcgcgggg ccgggccccg aaaaggcgca ggagggcacg aaggtctggc gccgacgctc 8622 0 

ggcgcacgcg gggcccgcgc acgggcagaa gagcagtgcg tgggtgagcg cgggcggccc 86280 

gcgggcgaag aagcggcgca gggcccggcg gcagcgggcg cggggacagc ccccctgcgc 86340 

agcccggccc aggcactgcg ccacatactc ggagcgcaaa cgctggcacc gcgcgtccgc 864 0 0 

cgtgcaggct tcggccgcgt ccacacatcg gttccctccg accgagctcg ccgaccctgg 86460 

gaaaggcgcg gggaggctgc aggtctcagt gcgccccgca tgcacacttc acccagccca 8652 0 

tccacgcccc tggaaggtgg gggacactga aaacccgaga aagcatagag tgtggcccaa 86580 

aggcggggcc gactggcact gatcggctct ccgtcgtggg gatgggcagc ggcgggctct 86640 

ggagggctct gcaacacgtc agggatggag agggagcatg gaggctgagc caggccttcc 86700 

ccaaaagctg aagcccctcc ttcccaagca tcgccaccat tcccagaacg gccacacacg 86760 

tccaaagacg ggtccagggg ctcctcctac actcaatggc actacctgac gggggaacaa 86820 

tgaggctttc tggggatctg gccaatggag actttaaaac cagagatggc atagaagaca 868 80 

gtggctgctc ctagtctctt aatgacaggc tgtgacgcca tacctttcct cctctacccc 86940 

ctccacctcg ccgcttatcc tccaccatgg agaccctttg ggtgggtcta ttgatttctg 87000 

tttctccctc cttcctgacc tcctgctcct cctcctgccc cctgcatggg tgtctcccct 87060 

ctccagccct tcctgatgct cccttgagtc gaggtcatcc tgccccatct gcactggctt 87120 

cctccagccc ctccaccctt catcagccag gggagagcgg atttatgcaa gagctctggc 87180 

cactgaagaa cccactgagt cctggctgcc agggtgtgga ttcagagaag tccctgacga 8 7240 

cacaccgccg ggctttgt cc tgagccacag gacccaaggg ttcatcagtg gttttagtga 87300 

ccaggcccca tgataggtga caccctgctg gagacagaga aggtgtctgg caccccagtc 873 60 
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acacaccagg ggttgggtag acagagggtg agctctgccc acatgcccac aggctgtgtc 8742 0 

taaatagagc caggataact ggaagttggg acttggcagc cccttcccag gatttgagtc 87480 

acaaaaacca ggctaatcga atccccccca gcctccacaa gttctccagt ctggagcatc 87540 

tcattcagac tagaagcctt tctggtgccc agcctgagtg gacgtgcaca gcctggcccc 87600 

tcatccatct ccctgtgtgt ttgtgcacag gggctacggg gtggtatccc cccccagggc 87660 

tgggagcagg gtgcctctct ggcaatggct gtgtctttca ggctgtggat tggaccgcgt 87720 

ggtgagagtg attcctgctg tgacttcagc acctgcctct gcaggaagga tgaacatagc 8778 0 

tcgagggctt agaatggtcc ctgagtggag gggatagatg gcagcggggt gagtcctccc 8784 0 

ttcctcccct ccccaaaggg cacactttgc tttcctctca cagccttgcc aggcacatcc 8790 0 

tcaggtctca gctgcacaca ccagcacctg atgttgccta agggtttgaa cccacctctg 87960 

cctgttgctt acctgtcacc cttgcccaga acacccttac ccagtctcca ccctcacagc 8802 0 

agctcatctc agctgcaaga tctcctccac cttaaacctt taccagattc cactttccac 8808 0 

actacactcg gtaaagtgac ccctccctcc gcgacatcca tgcagggacg gagcagtcca 8814 0 

agaaggtgga acaacagatg aacaacaccg aagattcacc aaggaatctg tgtgatgggt 88200 

aggagggctg gggctgaggg gagggtgaga gagttgggag atggcagaca ataggcaaaa 8826 0 

gttcctgggg cacgggtgcc aaggctggac tcgggagggg gatggcggtg gtggcttgac 8832 0 

ggggctggga ggtgaggtgc tcagcacata gtggggtcaa ttgccagggg tccccaagcc 883 8 0 

gtgcgcacag gcctctcact ggcaaccctt cctgtggttc agcttttcct ggaatatctt 8844 0 

gccctcccac cttgtcactg aggagcaggg acagagaggc cagggttcgg ggtccagggg 8850 0 

aagagagggg cgctcaccca gtaacagcag cagcagcagc gcaggcccca ggcagcggac 8856 0 

catgctggac cttcaacaga gaaggatggc tggcagagtc ctagtctgat aggcgccccc 8862 0 

ttcctagtgc ctctggagca ggagcacagt gaaaacccag ctgtctgcgc tgatacccct 88680 

gggaggagcc tttgctgcag caactccccg ccctcattaa gtctccaccc caggctgcac 88740 

cgtcaggtaa acctgggagc cacgggattc ggcgcctgaa tgtgcctaga cggaccccaa 88800 

cacactagcc ctgccacgca cacccagcaa cacacagaca accacccagg gagacctggt 8886 0 

ctctcccatg ctaaactagg aacgctatga aacagcattt ttcttttcct tttttatata 8892 0 

attctatttt aaacttaaaa aaattttttt gaatagagtt ggggtctcgc tctattgcgc 88980 

aggctggtct tgaactcctg ggctcaaacc atcctcccac ctcggcctct caaagtgcta 89040 

ggatgacagg tgtgagccac catgcctggc ctattttaaa cttcttattg ttgactaatt 89100 

ttagacttgc agaaaagttg gaaaatgtaa gcattttcct tacacccttc acccagcttc 89160 

ccctgatggt ccatcttact taatcatagt ccaatcatta cagctaggaa actaaccttg 8922 0 

gtacaatcct gttaacgaaa ctgtagactg ttttgcattt ctcatttttc cactaatgtc 8928 0 

ctttttctgt tccaagatcc attctaggat cccacatcac agttgtctcc ttattctcct 8934 0 

caatctggga gttccttcat ctttccttat ctttaccctt gacacttttg aagaatcctg 894 0 0 

gccagttatt ttgcagaatg tttctcttga gttgtctgat gttttattct gatcagaatg 89460 

agacacagca ttgttttgac taaccaaaaa gttattctat aagtaaatat tgaggttaaa 8952 0 

aatctcatcc aacctgggca acagagtaag gccctgtctc aaataaagtc tcaacactaa 895 8 0 

gatttaaaaa gtgaccagaa aagcccccac tatgatttgt cttgactttt ttttaaaaaa 8964 0 

caaacaaaca aacaaacaaa caaacaaaaa cgatgtcttg ctctctctct ccctcaggct 89700 

ggagtgcagt ggcacgatct tggctcactg caacctccgc ctcccaggtt caagcgattc 8976 0 

ttctgcctca gcctcccagg tagctgggat tacagacgcc cgccaccatg cccggctaat 8982 0 

ttttgtattt ttagtagaga caaggtttca ccatgttggc caggctggtc tcgaacttct 898 80 

aacctcaagt gacccgccca cctctgcctc ccaaagtgct ggaattacag acgtgagcaa 8994 0 

ctgcgcttgg cctcattagc atcttaaatc tccacacagg ggtgtgttcc ttactgttat 90000 

aaggagcaaa ggatcagttt gaggacaggt aaaataaaaa tgcgcttgct gcctagaggg 9006 0 

agaagtccct gctgaagata gctttgcttg aatgagctca attgcaatgc cagtgctgag 9012 0 

gcttgttgac tgtacggtca ccacagttgc tgctgcgcgc ctagaacatg gtcactttct 90180 

tgactaccta tcctgtctca gtacatctgt ctgtggtttg tggtggtcca tttcctaatt 9024 0 

tttttaatga atcagaagac tgtgatgtgc tttccgctgt gctaaccatg gccgctgaag 903 0 0 

caaaatgtaa accaagatgc ccctgcagtg gttgtgcttc actctacgac atctgttacc 903 6 0 

ggaaaggggt ccagattcag accccaggag agggttcttg gatctcgtgc aagaaagaat 90420 

ttgagacgag tccataaagt gaaagcacat ttattaagaa agtaaaagaa taaajagaatg 9048 0 

gctactccat agagagcgca gccctgaggg ctgctggttg cccattttta tggttgtttc 90540 

tggatgatct gctaaacagg ggtggattgt tcatgtctcc cctttttaga ccatatatgg 9060 0 

taacttcctg atactgccat ggcatctgta acctgtcatg gtgctggtgg gagtgtagca 90660 

gtggggaccg accagaggtc actctcatca ccatcttggt ttgggtgggt tttagccagc 9072 0 

ttctatattg caagctgatt tttttggttg gtttggtttt tgagacggag tctcgctcca 90780 

ggctggagtg cagtgacatg atctcagctc actgcaacct ccacctcctg ggttcaagcg 9084 0 
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attctcctgc ctcagcctcc caagtagctg ggattacagg cacacaccac catgcctggc 90900 

tagtttttgt atttttagta gagatgaggt ttcaccttgt tggccaggct ggtctcgaac 9096 0 

ttctgacctc aggtgatccg cccacctcag cctcccaaag tgctgggatt acaggcgtga 9102 0 

gctaccgcgc ctggcctact gcaaactatt ttatcagcaa ggtctttatg acctgtatct 910 8 0 

cctatctcat cctgtgacgc agaatgctgt aactgtctgg aaacgcagcc cagtaggtct 9114 0 

cagccttatt ttgctcagcc cctattcagg atggagttgc tctggttcac aggcctctga 912 0 0 

cacatcctct tgtgttttgg cgtgtgggag gaaagagggg tgagggaagg aaactcaaaa 9126 0 

ccaagctctg accacacagg gcaggtacac tctcccacct gtctgtgggt gccacaagtc 9132 0 

aagggagggg cagagagaga agaaggtgtg acagatggcc gcaggccaca gaatgtcaga 913 8 0 

ggaagcccag ttcctcccgg ggcagcccaa gtagctggta gttgggtggc caaacagagg 9144 0 

gcgtcacagc tgagctgggc tcgctcgcta cccccagctc agcgtccact ctgcccctca 915 0 0 

gtacctcctg ctcagcctca gggtccatgc ctaccctcct gcttcccagt cacttctgcg 915 6 0 

tgcctcctgc ttttctgctg ttggccccat gccagctcct ttctgctgag cttctcttct 9162 0 

ccagttctgc agcacagcca ggtgatcctg ggctccagac aggcctctcc cccagtctgg 916 8 0 

ggcctcccct cttagagccc tctttccttc ccacgtggcc tccccagggt tcgccactga 9174 0 

atggagaagg ggtggagggg gtgctgggca gtcttgggga ttagccaaga gggcagagtt 9180 0 

ggcctcccca gggtcccttg tagctggagt cccgcggggt ctagacaccc cctcctgaag 918 6 0 

ggtaagagcg ggggaggtat attaacgtgt atttttagag tctctctttt tttttttttt 9192 0 

ttttttgaga cggagtcttg ctcttgttgc ccaggctgga gtgcagtggc acgatctcag 919 8 0 

ctcactgcaa cctccacctc ccatgttcaa acaattctcc tgccgcagcc tcccgagtag 9204 0 

ctgggattat aggcacgtgc caccacaccg ggctaatttt tgtattttta gtagagatgg 9210 0 

ggtttcacca cattggccag gctggtctcg aacttctgac ctcaaatgat cctcccgcct 9216 0 

tggcctccca aagtgctggg attacaggcg tgagccacca tgcctggcct tagattctct 9222 0 

attttatgat ggtttaacat ctcggggtgg gggcttgttg gctggagaga aactgcttga 922 8 0 

ttcctggaga tcagaaacaa ctcatgcctt tcatatgcaa accgaccagt cttgagtcca 92340 

tacaccaacc acccccttca aggaactctc acatacgaaa ccagtatttc ccctgcccta 924 0 0 

aaccagctca gggccaggca cctacccaga caattagagc ccaaccccac gccccaaacc 924 6 0 

ccaccagaat gattcaaaat gccaatccta ccctcttccc ctggcctgcc ttgcctcccc 9252 0 

agtggaaacg gcactgtggg ctgtggcctg tgccttccac tcgctcctga ttctgtccct 9258 0 

ggaccaaacc tagtgcctcc ccactgtggc cctgcatggt gggaaactgt gagtaataac 9264 0 

ttatttcaac agcattggcc tctgtgtcat cagtcacctt cataaattaa aattttgcaa 92700 

ctacaactga ggcaggaggg ggctttagag agcactctca cctggcccct tgggcttgtg 92760 

gagtggagcc cgaggtgaag ttgcctccct gcactcagtt ttgggatggt tttgttatgc 9282 0 

ggcaacagct ggctgatata ggccactcag cagctcttgc atgaagcaat gggagaatgt 9288 0 

gaacgcccaa gggaggcagg agtgacagag caaagagggt gttcaaacta ggcaacaccg 92940 

tcctgtgccc agacacatgc ctggggctct ggctgccatc taggctcggc ctgccagcca 93000 

ccaccccgtc agccaccacc ccaccaacca ccaccccgcc aaccaccacc ccgtgtggtc 93 060 

ctcagggcac cccatgggtt gaatttacat acagaaaaga ctaaccaagg tccaagtgta 93120 

atatgtcttt ttaaaattta ttttagagac agggtcttgc tctgtcaccc aggctggagt 9318 0 

gcagtggtgc caccatagct cactgcagtg ttgaactcag gctcaagtga tcctcctgct 9324 0 

tcagcctcct gagcagctag gcctacaggt acacaccacc acgcccagct aattttaaaa 933 00 

tttttctgta gatgcggtgt cttgctgtgt tgcccaggct ggtgtggacc tcctggcctc 93 3 60 

tggtgatcct cctaccccag cttcccaaag tgctgggatt ataggcatga gccaccacgc 93420 

ctatagccaa catgtctttc ttttgacttc tactttggta tcttttctta aatggttccc 934 80 

tctgtccccc cgacacacac agaatggggg agaggctgtc agattctgag ctccagaacc 93 54 0 
tcaggtgtag cactgggatt gggggtgggg gctcaggaac cacctagggg agaagacagg - 93 6 00 

gtgggaagaa acaggaagga aggtccccaa aattatgttt gtttgcagag gccagccagg 93 660 

ctccagggga gtgtggactc agtcgaacca tagggcccca ggaccactag cttctggcca 93 720 

gcagtcatgc cctccacaga gctgggtccg tggaaattgc atgtaggaga cacaccagac 93 780 

tcccaggaca gagccctttt gggatggcca gcactaccca gcctccactg gtgagggagg 93 84 0 

tcaggggctg tgtgaccttt gcttctggga ctgatggttt attgagctgg agagtgtgcc 93 9 00 

cagcagtgtt ctccagccct caggaacttc tagtgtggct ctgggttcct ggagtgggtg 939 60 

ggtcgaagct ccactcgggg aagaaacttc caagctgcct gcaggtgctg gaggtccggt 94 020 

gattcactgg ctctgcccct gcagttcaag ttcctggagt ggctgtcagt ggccacctgt 94 080 

ctttaaatct gttcatttta ggagctacct ctcaccagag gcaggatctt ggcatctgga 9414 0 

cttgatctgc tgagaatgag gaggatatgt tgtcccctaa ggactggggc cccaggctgc 942 0 0 

aagctgtgtg gcagagagcc catcctcact cagtgaggac cagtgatcca ggaaaagcca 942 6 0 

cagcttctcc ctccccagcc caggggcttc cagcatcctg gtctccatga taaccaagag 9432 0 
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gtcataaact catttccata ataacctgag cccagaaacc tgattagggg gcagcaaact 94380 

gaggggtggg agaggtggga gggtgggcga tgagaggggg aggctttgaa tccaggtccc 9444 0 

tgcctacctt gggggtcagg cgagactgct ggcagaggct tctcagggtg gctgctgggc 94500 

tcatgagagt tctcagggtc tgggagaaat ggtggagggt aaatgttgtg aatatggtca 9456 0 

gcaggagacc ctggggctgg ggaggggcat aggggactca aggtgactgg gtgctgccca 94620 

tctggaagga ggcaggaggc atgagccctt cccttctccc ttccctctcc acctccccct 9468 0 

ggtgcctcac tcacccaggg gccagggctg tccagtggct gtggggccca actccatggg 9474 0 

gtgaacgccg cccagggggt ggtccctgtg tgggccatct ttggggctga gcaacgtgat 94800 

aagagtccag gaggttggca cagtgatcct gagtgggtta ttgcctcccc gcagcatggt 9486 0 

gtccagccca gggagttctg cgtttactga gtttcttggg gcacccatct gctccaagtc 94920 

accctctcag ctcccttcct gctccctctt caggggagcc ttgggatcca ggctccaagt 9498 0 

gagcctcatg ccctcggctg gcacctcctc tctctagtcc taacatttcc tccaggctct 9504 0 

gacaccaccc agcagcctgg cactctccag atgctggcat cgctcagctt ccaaagaacc 9510 0 

ttggatgtcc gccccttcgg cagctatgtc tgctctcctt gcccctgggt gccctgctgc 95160 

ccttgatgat tccaagccat ctttgactgt ccccatccca tcccccaagg ccttgtcatt 9522 0 

tcctgtgatg ttccttcaaa acattctccc ctgccctgag actcccgcct ggggatgaga 9528 0 

agcagccgcc accctctgca gcgccccctc cgtgctgaca cgccaggctc tggccacctt 953 4 0 

gctcctctgc ccacagaccc tcaatcacaa ctcgctttgt cagggtcctc ttagctgcca 95400 

cccggggccc aggtggtgcc ctgcccctgt ctgttatgcc ctctgccccc atctctggcc 954 60 

caaatcatgc catctccctt ggcttgcccg gagcactccc aagaccaggc tatgtcagac 9552 0 

atggccacag agtgcctgcc ctgcctaggg ccctggtgca gggtgagtcc taggacagcc 9558 0 

atgcttagta ttatgtgact ccccactccg ccaccaccca ggtcacagag aactgggtta 95640 

aggcagggcc ctggcacagg ggcagccagc accgcagctg accagtggta tggagtgaaa 9570 0 

agatgtgctg ggcccagcat ttgggaactt caagggggtg acagaggtga tttgtgcaga 95760 

ggaagtggca aagggccgaa aactggtgag acagaggctg gacaggcctc cgggggcagc 95 82 0 

atggtacagg gactgcaatc tgagccaggg aaaaacaggg cgaagtcaag ggtgaggcag 95880 

ccagctggtg ggagaagcag gagagtggac aagaggagct gtactgggag gtagagggcc 9594 0 

atgccttgcg gtgctggtgg tggggcaggg atccaccccc tcgcttgact gggaggccac 960 00 

tggaacctcc tgttcaaagc tacttctttc catggcctct ggggctgctc tctgcacctg 96060 

ggggcaaggc tgagggcctg ccccagctcc cacagcccca gcagagctct gaggagggga 9612 0 

accgcaggag taggctcagg aagcaggcgc tcggagccta cccactgcac gcagggtccc 9618 0 

ttctgcagcc ccagctgcat cgctgcagat gggctcctgg gagtcggtag caacaccagg 96240 

ccaggccggc ccctgggagc agaggcagca ggacgctgag gagcatggcc agcaggaagg 963 00 

tgtcatggtc tgcggggatt gggggaaggg gcgctgagtc ctgagcaggt gcaccacccc 963 60 

agctcctgcc cacatgcccc ccactggcat actttcagcc tgcacagggc cactgtccat 96420 

gctgccacca aagcctggct tgtcacagaa gggtggagcc cagcctggag cacagtggca 964 80 

gttatggttg ctattgcaaa cctgcagaga agagaagagg agggtcacgt aggattagga 96540 

accccaaggt cacccccact cctcgggctc tcaccccgtg gctgtggcag gcagtcaggc 966 00 

agcgctgaag ctcctggaag gcattcttcc tgcagcgcct gctctggcac acctacgggc 966 60 

agtgcaccag gcagtgaggg ggacactggc ctgcgggatt caaacggcaa ggaggggtcg 96720 

ggtgggcaga get caeca tt ctaggtccac actgggtgcc tggctctacc aggcccaggc 9678 0 

caagcaggtc cagctgggca ctggggagtg ccaaggctcc ccgacaagtc acttcctggc 9684 0 

catctaggtg aacggtagag tccactggca ccatgtgcgg tgegagcagg ctgggctttc 96900 

caccctggca ctgcagcttc ccacacaggg catccctggg gaggaagtag aggggggtca 969 60 

acagctgeag tacccccctt ccccaaaccc actccatagc ttctgctccc tccactcagc 9702 0 

tccactccct acctccctgc acagggcagg aagtggccct cgctgtcctg gccgcagttt 97080 

ccatgagcat ctcccgcaga gttcaccacc tggaaacagg cctcgggagc tgggtgggag 9714 0 

cctgaggaag catgggccag gctgggggca geteggagag gggctgeget cagcgggcct 97200 

cagtttcccc tgcccatctt ccccacagta gaaaactggc cccaccagag gatggggggc 972 60 

agggtgcaag ggtgctcgtg tcctctcacc aggcccccag agetgetgge actgctgctc 97320 

cagcgtggga catgcgccat cccagcagta gccactgccc ctggcacagg gtgagccgtc 973 80 

cagtaggtaa acgtctgggg gacagtggga ggaggtgccc gtgeaaaact cagggaggtc 97440 

acagtcaccc atggcctggc ggcacagcgc tccagccggc ttcagctgcg caggtgaegg 97500 

gtggtgggga aggcagagag aggecaegtg cagtgagagg tccatgccga gagegegget 97560 

cggagctggg ggagecagge ctacccaagc ccagcaccca aegggggaac ctgagggcac 97620 

caattaacta aggecaacag gccggctccc aagctccccg aaaccctcac cctgaacctt 9768 0 

ccatgccctc accaggcagc gcacgcagca gtccccgtgg gcgcactggg cccccgggcg 97740 

cagegagcag ttgtgagcaa agcagcagag gtcgcggcac tcctgggacc agaaaggcaa 978 00 
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gaagggccca 
ggcgagccga 
cgttcccgca 
cgccgccccc 
gcggaaacgg 
cccccgcgct 
ccagagaggg 
ccccaatagt 
gtgtaggagt 
tttgcctgag 
ttatccccat 
tccccaggac 
agccatgacg 
gctgaggccg 
ctccgagtgg 
tctccctcgc 
cacatgccct 
gcggagcgcg 
tccgggtcag 
gcccccggcg 
tgcggtcccg 
tgagaagctg 
tccttcgtgg 
cgccaccccc 
ttccaggaga 
cagaaatccc 
gggtctctcc 
ctgcgcgctt 
ggcttgagcc 
cttgtttctt 
tctgatttgg 
ctgctctggg 
ccacaggttc 
tccttggagc 
atcaggccac 
aggcccagcc 
ggagtcgggg 
ccagccccaa 
gctggctcca 
tacctctggc 
caccactggc 
gctgagaggg 
tctcccagcc 
gtgtcttagg 
agaggaagga 
gaggacgccg 
caggcagcag 
gacttggagt 
gttgggcatc 
cagacttcta 
cctccagaga 
acacccactc 
ggatggatca 
cttggcctca 
tccaccacct 
aggtaggtga 
acagtggaac 
gtcaagtgct 



ggtgagggcg 
cttaacctgg 
gagcgccggc 
cttgcggaag 
gtgcctaccg 
tctctgatcc 
gaagtaaccc 
gage age c eg 
aacctcgcca 
cttcttccct 
tttacagaag 
tagccgcccc 
cagcctccgg 
aggctgtggc 
tcctgggggg 
ccccgcccgc 
cgacgggcgc 
gcgtgaccag 
aggcacccac 
ccactgcagg 
ctcggtccac 
agggegagge 
ggcgcccttc 
atcaccgctc 
cgctgtttgg 
ggtggcttga 
tcaccagggt 
ctcgcctgcc 
ctgaccccca 
cagacttcaa 
agggctatag 
gaccaccagg 
ctttccaggt 
cccggggtgg 
tagggtgcag 
tcctctccca 
aagcccctta 
atctcagcca 
accgcccctt 
ggtgcatccc 
tgcccatctg 
ggtgttacag 
ttcctcccta 
gaagggacag 
gccttggtga 
gggaggagtg 
gatccacagg 
tggggaaatg 
tggtcgaccc 
gaccccacat 
aaacagaatc 
eggtaatgat 
atca'taagtc 
ttacctccag 
gccccagctc 
tgtgaccacc 
tagagtttgt 
cagaacagcg 



cagcgcccca 
ccagggccgc 
ggcaccggga 
aaggegegea 
geaeggggag 
ccacccctgg 
gcgcaaagtc 
ggacccaagg 
ggttactcgg 
ttaagectea 
aggaaactga 
gccgcagccc 
actcggccgc 
cgatctcatg 
ccgtgggagg 

ggggctcacc 

caggcccact 
geggggcegg 
gtgagcagct 
aaggeccaga 
acctccaggc 

ggggctgaag 

ctcttcccca 
tctccccgcc 
tgtggttcaa 
ggggctgagc 
gtggtctgcc 
ctgcggaggt 
accccagctc 
taaaaatact 
atacagcatt 
aaggctggtc 
gagcagctgc 
ccagggacgc 
aggggtagga 
gagctcacct 
ctcgcccttg 
gggctggagc 
aggaatgeaa 
agageccatg 
ggccgtagtg 
ggaacactga 
aatgctaatg 
gagcaatggt 
tggcttagtt 
ggaatagggg 
ategggaggg 
cggtgacctc 
tcttgccccc 
acctcccacc 
gccacctgcc 
tccatcttca 
aatctgtctt 
accacccgct 
tgccccctac 
ccattgaaag 
tccctgtcca 
cctgcggtca 
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gaectgageg 
agtcacactc 
gtccggggtc 
gctggcggcg 
ggcattgggc 
gcttggctac 
acacaacaag 
tggaatcgcg 
aaaataatct 
taaaccaccc 
gggacgacca 
cgacccccca 
agcctccacg 
ggccatggtg 
gcggtcactg 
gtgctcacgc 
gtggcgccct 
gaggtgaggc 
gcgcggagtc 
gcgtggcgtt 
cggtcagcgc 
cegggacagg 
aaccccacca 
gcccccaacc 
gtttcggtgc 
tggccccatc 
acaatgtaca 
gcaaatgggg 
ccagaaggaa 
gggactcgag 
cccactccca 
atgcccgctt 
tccatccgaa 
agataatagc 
gcgggtgtga 
catcccagag 
gtagtggcaa 
aagaggggca 
ggaggagtag 
gaagcatctc 
ggtttctatg 
attcagcttc 
gagcagcttt 
gaettgetea 
gtgggagatg 
aagagtttgt 
gaggagtcag 
ccccagttcc 
acctgcccta 
tgtccttcag 
cacgctgctt 
ggctccatct 
ctttaaagaa 
aatgatggct 
ccctgcccca 
ggtagggacg 
tccgactcca 
agcactcagt 



gagagggcaa 
ctcgcccgct 
eggggcattg 
getgeaggeg 
atggagggac 
agccgccaga 
egggacaggg 
acccgacggt 
teatacegtt 
tgaageggae 
aagaaacgea 
cccgcgtacc 
cagcagccgt 
gctgcggcgc 
eggcegtaga 
ctcccgagct 
ggaaggcgcg 
cgccccaccc 
gtggggccgc 
ggcgtcctgc 
cacctgaatg 
gcgccccatc 
gcacctgcct 
tggtccacgt 
cgagtcaaga 
cctgaccccg 
gttccaggta 
accctgagtg 
gtttaacatg 
gcctgtgaat 
tccgatcgat 
tgttcccagg 
agatctegtg 
tggcattcct 
gggagctctt 
caggtgeaga 
tgatcctagg 
agagggaggg 
gggtaggaat 
accgtgtggt 
tatcctgggg 
ctcctgcctc 
atgagtgaga 
gatcagaaac 
tgaatatggg 
ggttcccagg 
gagacactgc 
cctgcctgct 
gatccttgaa 
tgattgatgc 
ccaccctccc 
caacaggatc 
aatccttaac 
gcttcccccc 
cacacacccg 
tegggaaaat 
cgagggagaa 
aggtgatata 



gtgggggccg 
tccacgaagc 
gagaggcaag 
ctgaacacgc 
agtcccccaa 
cgcgcagagc 
gaegatgegg 
gctcctcccg 
gagaatccac 
actatgatca 
gcggaggaag 
cggtggccgc 
eggggtegtg 
cgatggggag 
gcctcctgtc 
ctcggcgcgg 
gcccctgggg 
gggacccgcg 
tgcgcccaca 
gtgacgegge 
tccagagtcc 
gcgccggtgg 
gtcctgccgc 
agttggcgac 
actgggaagg 
ccaacccctg 
cttccgggtc 
gaagctgctg 
ttttctggaa 
tcctgtctct 
gcccctgacc 
atccctgtgg 
ggttgagaag 
gctgagggtg 
tccccatccc 
ggactaccca 
gaggaagggg 
tgtggtaggg 
ggtggggggg 
tgggggccag 
ccagcagcct 
ctccaggatg 
cactcacagt 
tcttggggct 
aagcaccagg 
ggacctgcag 
cgaagaatgg 
gccctccttt 
atattttcct 
tcaccccctg 
tgccttctcc 
tttcccacac 
ccaacctcac 
tcccaggcat 
ccaccctagg 
atggttgggc 
taaaatacgt 
tactgataac 



97860 
97920 
97980 
98040 
98100 
98160 
98220 
98280 
98340 
98400 
98460 
98520 
98580 
98640 
98700 
98760 
98820 
98880 
98940 
99000 
99060 
99120 
99180 
99240 
99300 
99360 
99420 
99480 
99540 
99600 
99660 
99720 
99780 
99840 
99900 
99960 
100020 
100080 
100140 
100200 
100260 
100320 
100380 
100440 
100500 
100560 
100620 
100680 
100740 
100800 
100860 
100920 
100980 
101040 
101100 
101160 
101220 
101280 
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ataatctggg 
aggcaccttc 
gatggtccag 
ccaaattcat 
aaactgctgc 
tttttctttt 
gagtggtgca 
tctcccgtct 
ttttttgtat 
ctgacctcag 
attcccttag 
tatcttgcaa 
ctcctggggt 
ttcagcctcc 
ctggggtcct 
actgatcccc 
agcaggagct 
tgggcaagaa 
taggtggcct 
cagcaggacc 
cccaaactca 
tgccaggcac 
ggccccgccc 
tgctgaggac 
tacctgaggc 
cctccacttc 
gcccccgtcc 
cattccagcc 
ccagccctct 
tcccccttgt 
tgggaggctg 
tggcctgggg 
ggggagcaga 
catgaggctg 
actcgaggtc 
tgccccttga 
tacctagcct 
accggtgggg 
agacagggag 
atggggaggg 
ttatgggcct 
cctctcagcc 
gctgaaaatt 
aaatcgttca 
caatttcccc 
tactgccctc 
ctggaagcgt 
gcacatgggg 
ccagaagtgg 
ctgagaggga 
ttctttttgg 
ccacaaactc 
agggagggaa 
cacgagatcc 
gcaggggcct 
gtaatgtagg 
catggcactg 
gtgcggggtg 



tggttttaag 
ctgagcacag 
gacactgagc 
tcccccctgc 
ccccaagtgt 
tctttttttt 
gtggtgtgat 
cagcccccta 
ttttagtaga 
gtgagccgcc 
cctggggtgc 
atgtccacta 
tgggagggcg 
agggagccaa 
ctgcccatcc 
tcctcccagc 
cctggccttc 
agagtgtgga 
gtggagccgg 
ccccacatac 
ggatgtttgc 
accagcatcc 
tcttctttca 
attatggggg 
ttggttccaa 
caattttggc 
ctccggctcc 
aagagagcac 
ctccttcccc 
ccactcagcc 
ctgcgccttc 
ccctgtccaa 
tggtccttgc 
gaatgccact 
cctgttcctc 
tatatacccc 
ctactctttc 
gaagcagagc 
gcccagaggc 
caatctcaaa 
aggggatccc 
ttcatggcaa 
ccaaacaaag 
gaccttagtg 
aagctctctg 
ttagtttgca 
gtgaagagcg 
tggtcttaca 
attccacagg 
cagcagtgtg 
tgcttttttt 
tggacccatt 
agagggaggt 
agatgtccaa 
ataggagtag 
gagagacaaa 
accggctcct 
actggctgcc 



agcctgcgct 
gtctgcctgt 
accaaaatgg 
tcccctccct 
agccactgca 
ctcttttttt 
cttggctcac 
agtagctggg 
gacggggttt 
cgcctcggcc 
cggctcatct 
ggtggtgctc 
gaggtgcccc 
ccacaggcct 
agccacccgc 
cctaccccag 
agcctccagg 
gctgagatgg 
ctggggagga 
tatggagggc 
agtcctgaca 
tccctagaga 
tgcccagaag 
catcgggtag 
atgagctgct 
aaactgctcc 
gacacctgct 
actggctctc 
caccctcttg 
tggctggggg 
ggaatgttta 
gggtctcaga 
ctccctggcc 
tgctttattg 
tgcagcccct 
tcatcttgaa 
ttctgtacca 
ccctttgggg 
agagtgaccc 
gctcaggcca 
agtgagggcc 
tgctcccctg 
agggctctct 
ggtctcaggc 
ggcttccgtt 
gcacagatat 
ggattgggaa 
tctgggggac 
agccagtgag 
tccacagggt 
tttttttttt 
tctgcctggt 

ggggtggaga 

accccagtgc 
tgacttggtg 
agggctggga 
ccaggctgac 
cagggatatg 
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ccagcccgga 
cccttcccca 
ccttctgtga 
gtagatggag 
tctttttctt 
gagacggagt 
cacaacctct 
attacaggcg 
caccatgttg 
tcccaaagcc 
tttccctcta 
actcatcgcc 
acagcggggc 
gcgtgactct 
acagatctgt 
cctggcactc 
gccaccagcc 
tggcctccag 
agttcttggg 
gtggaggagg 
acagccactt 
ccagaggtcc 
gggcatcaaa 
ccatggtcag 
gctcatttcc 
ctcttccaaa 
tcactggacc 
agctaggtgt 
gcgtctccca 
cc cagagttg 
acccccgact 
gtctttggag 
acaggcccca 
gggaaaggtc 
gctatcttta 
atgtgagtgt 
tcttggcagg 
ctctcctctt 
cagtgtgtgt 
gtgccgtgct 
ctgggttggg 

CCttCGCtCt 

aggaggggca 
tcccagccta 
tgcccctctg 
tatggcacaa 
gaggctgggg 
atcaggagag 
gctgaaggtt 
cacatgttct 
tttttagagg 

gggggtgggg 

aagctgaaat 
cttcttctgc 
gttctgggga 
ggttcagggc 
ggtgcgccag 
tcctggagta 



cacccacccc 
gctcaaaatc 
tctggccctc 
accttccagg 
ttttcttttc 
ctcactcttt 
acctccgggg 
ccgccaccac 
gcaaggctgg 
accgcatctt 
ggatttcttt 
agcagggagc 
tgacagcctc 
ccctgtcatc 
gtcagtccct 
actggttctt 
ccatgtctgg 
gcctcctgcc 
gagaacgtgg 
tgagaacata 
ggaagggcgt 
cagaaaggtg 
agcaggggaa 
ggcctcctca 
tatagaattc 
gttttcctgg 
cacgaagtaa 
caggaggctg 
ccctgtggga 
gagccggccc 
ccttttctcc 
ggagttcttc 
caaggcctcc 
tgcaccggga 
ctcttgccct 
ttcctgcctt 
cttcctgggg 
ggtcacagcc 
ccagccttcc 
tgaccagtgg 
agctgctggg 
tgctggattt 
ggggtgtagc 
aagagctgtg 
taaaatgagc 
acagatgggg 
cagagcgtcc 
tgaccactgc 
caggccttcg 
ggtcaacttt 
ctcctgaaaa 
gtggcccaga 
gacttccatg 
ccacttgagg 
ccccagcaaa 
ccctgtggag 
ggttgtccat 
aagacagagc 



acccagccca 
tttggctctg 
ctgggactct 
caggtcagac 
tttctttttt 
gcccaggccg 
ttcaagcgat 
gcctggctaa 
tctcaaactc 
ggtccctgcc 
agactcagca 
taacaagccg 
agcggtcctc 
tgcaccctct 
gccccccaac 
ctccagctca 
cttcgagacc 
tgccagggag 
gctggggagt 
caaagatgtt 
tggcacagcc 
cccctcccct 
gacagagggg 
gagcctctgc 
aaatttgact 
gcctccagca 
acatggacgc 
gcttggacgg 
acaccccact 
aggagcttcc 
aaaaatgcac 
cttcgcaagt 
agcatgagct 
aaaaggccat 
cctggtaccc 
ttggagggga 
gcaggggccc 
caggccagac 
cctcctgggg 
aatgggggcc 
tctctggggg 
ggacagtagg 
caatggttta 
tgaccatgga 
atatcaaggc 
catggttatt 
tgcagaagaa 
cccccccata 
tggcagggcc 
gcaaaaggtt 
gcttcaggac 
tcatccaggg 
tgtgcgggct 
ggcaggggag 
actagaagct 
ggcggggaga 
ccaggaccca 
acagggtgag 



101340 
101400 
101460 
101520 
101580 
101640 
101700 
101760 
101820 
101880 
101940 
102000 
102060 
102120 
102180 
102240 
102300 
102360 
102420 
102480 
102540 
102600 
102660 
102720 
102780 
102840 
102900 
102960 
103020 
103080 
103140 
103200 
103260 
103320 
103380 
103440 
103500 
103560 
103620 
103680 
103740 
103800 
103860 
103920 
103980 
104040 
104100 
104160 
104220 
104280 
104340 
104400 
104460 
104520 
10458 0 
104640 
104700 
104760 
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ggggacctga 
ccagggaggc 
ctggagaagg 
aaagagaagg 

ggggggcagg 

gaaggaaaga 
agagatgtac 
gatgttgcct 
agggttttgg 
tccggacgga 
ccagagccaa 
tccactggcc 
gagcctgcag 
cccccctcca 
ctcagctcct 
caaacatcaa 
aggcagatga 
agggagggct 
gcacttaact 
aagaggtagg 
gtccacatgg 
aggggcagag 
gagggagcca 
ccctgacccc 

tggggccaga 

caccgcccag 
tgttcccggc 
ttgttccagc 
gtctttccag 
ccaaacaagg 
gaggggtgag 
ctgggggact 
gtggaggagc 
gaatccaggc 
gggggaggga 
gtccgggccc 
ccccccaaag 
ctccgcgcgc 
cctgcccctc 
ctggccagag 
gcctccagcc 
cgcccaggct 
ccttcccgcc 
cgctcggagc 
cccggaccgc 
tcggatgacc 
ggaggagaga 
ggcaaaggcc 

tggggccgat 
gttgagtcgg 
ggctgccgct 
cctttcttcc 
ctgggagacc 
ctcggtgcaa 
ctccctctgt 
tccctggttc 
gctaccaccc 
aggatggtct 



ggaacacagg 
actggaggcc 
agggacgccc 
ctggaaaggg 
cttgggctcg 
gcgcggcagc 
gaagagagga 
tgcgaggtcc 
taccctggcc 
gccttggtgc 
gcattactca 
ccaaccactc 
cagccctcct 
cgcagccctg 
gctaggccac 
agacccttcc 
gggccgagag 
ctgagtggat 
gagagtagag 
gtgggcccgg 
ccttgcgcgc 
ctggagctgg 
ggctggtatt 
aggatagggc 
acagaccagg 
tctgcatagc 
caggcacttg 
cccatcttaa 
agcactgcta 
caacaatata 
cagggtggag 
cttgggcatg 
aggagcgggt 
agagggggct 
gccatgccct 

ggggccagcc 

agtccccacg 
tgacccgagc 
agggcgcacc 
cagcagcagt 
catagctgtg 
ggattgcggc 
cccacctccc 
ctggccaggg 
cgcccctcga 
ggcccgagcc 
accgctggga 
gcctctcgct 
ctctgtcccc 
gtcatccacc 
gtcttggggg 
ctggcggtgg 
ctttcggcct 
atttggatga 
cgccaggctg 
aagcgattct 
ccagctaact 
caatctcctg 



ggcatgggac 
ttttggggca 

gtgggcatcc 

acagctcagc 
gcgatcacaa 
cttagggatt 
cacttgtgcc 
accttgccca 
cagaaggccc 
tccctggccg 
gagggcgcct 
tcacaggccc 
tcccccggta 
gcctcagtcc 
cccaactcct 
tccaagaagc 
gagtctgcca 
gtgggggagg 
aaccaagctt 
gcacaggtgt 
acgtgctggc 
agctggagct 
gggcacatag 
tcagagggga 
cccctgtacc 
acagcccagc 
ctgtgcagga 
catctttgtt 
aacagacacc 
aacaacatac 
gccacagcta 
ggggcttCCt 
cacacggtgg 
acggctatgg 
ctgctttgcc 
caagcacact 
ccctgaccta 
tctgagcaga 
ccgcccgcgt 
agtagcagca 
agctcctcgg 
cgacccgtgc 
accgcccgcc 
gccccgacgg 

gggggtggag 

gtccctcccc 
aggttcttgg 
gctgaagttg 
tccttgctac 
tgcactgggt 
cttcattgct 
ccagagggca 
cagtaacatc 
gatgttaact 
gagtgcagcg 
cctgcctcag 
tttgtatttt 
atctcgtgat 



aaagcagagg 
gacttcacct 
ttggatctga 
tggggacacg 
ggaagaggcc 
tttagatggg 
ccccccatca 
gagctccctc 
ctcctcatct 
ggtgtgccta 
ggaatgttcc 
gccctgcagg 
cccagtccca 
tgctgggctg 
tttttctcct 
cctccttgat 
ccttggaaag 
ggaaggaggg 
tgctgctcct 
ggtgtgggtg 
cacgggcacc 
ccggtgaccg 
acgcccctct 
gggagcagtg 
tgtttggtcc 
cccgcatgcc 
ctggctaatc 
tggaggggtt 
ttctatctgg 
acactgccct 
gttctgcagc 
agaggatgga 
cctgcggatg 
gctgggggct 
agcggagtgg 
tgagcgtccc 
ctggccgtat 
acccatcccc 
cctcaccttg 
gcagcaacgg 
cctctaggca 
ccggtgcagc 
ctcgctcccg 
cgcgcgccat 
ctgggcggag 
gtcggtctca 
aggaccgcgg 
gccccaggag 
ccgtcctgcc 
gcccccaagg 
gggcaaaggc 
gaattgcata 
tgtttcatgt 
tttttttttt 
gcgcgatctt 
cctcccgagt 
tagcagagac 
ccacccgcct 



gaggggggta 
tcaacacgcg 
ggagctatca 
ggagtcccct 
aaggccgcca 
cagcagatgc 
tctgagaaaa 
ggggactctg 
cttcaagggg 
agggggctct 
cctggaatgc 
agccaggccc 
gctcccagaa 
atggctgcct 
ccaccttctc 
tggatgagtg 
gaggctagag 
gaggtctctc 
aggcctctaa 
cagtgtggtg 
ctgaccccaa 
ggtgaatggg 
cccaggggtc 
gaccgcctgg 
ccacacagtg 
ccctccctgg 
ctcccacccg 
accccgagga 
agaggccctt 
gctgccctgg 
ctgagagcaa 
gccccgctga 
gaagctggtt 

gggggctggg 

cagccgggca 
tgggcaggtc 
ggtgccgggg 
gccaccacca 
aagcaccccg 
ggtcccccga 
gcggctcgca 
ctcaggccgc 
cctcccctcc 

gggggagccg 

gagggaatcc 
gagggcctct 
cgtggtggga 
cgcgatcttc 
ccgagggtgc 
ataggaaggt 
gatgcagcag 
aaagctgcag 
tttaaacttt 
tttttttttt 
ggctcactgc 
agctgggact 
gaggtttcac 
cggcctctca 



gaggacatcc 
tgggctcagc 
aggaggagga 
gacctttgtc 
gtgcagaggg 
ctttagggtg 
acaacagcca 
tcctggtggc 
aggggacgct 
aggaggaatc 
tcccagcccc 
caggcaccca 
cagacagcct 
gtggaagtga 
tcccagacta 
aattgccatc 
gggccagtgc 
agcccagaga 
gggtttgggg 
tgtgggtgct 
tgagggagag 

ggtggaaccc 

ccatcacctc 
ggccctcccc 
ctgtggaagc 
ttgccctccc 
cttgcagagg 
gacagctgca 
ctctatctca 
gaggagggac 
agcagggact 
gtcctaaggg 
gtgagagcga 
ctgtccccga 
gtgtgggcaa 
ccacggagac 
ccgtgagacc 
ccgcgcctag 
gcgcctggca 
gctctccggg 
actccggctc 
cgccttcgga 
ccgccaaccc 
ggtcgccact 
gtgcggcccc 
actcctgaga 
tgaggcggtg 
cgtggtctcc 
cctggcggag 
tcaggcaacc 
acggagacaa 
actcccaggc 
tgttttccta 
gagatggagt 
aacctccgac 
acaggcgcgc 
cattttggcc 
aagcgctggg 



104820 
104880 
104940 
105000 
105060 
105120 
105180 
105240 
105300 
105360 
105420 
105480 
105540 
105600 
105660 
105720 
105780 
105840 
105900 
105960 
106020 
106080 
106140 
106200 
106260 
106320 
106380 
106440 
106500 
106560 
106620 
106680 
106740 
106800 
106860 
106920 
106980 
107040 
107100 
107160 
107220 
107280 
107340 
107400 
107460 
107520 
107580 
107640 
107700 
107760 
107820 
107880 
107940 
108000 
108060 
10S120 
108180 
108240 
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attacaggca 
tttttttttt 
tgatctgggc 
cccgagtagc 
agagatgggg 
gcccgcctcg 
ttttaagtaa 
tcgagcagtg 
gggaaaagtc 
tttcataggc 
ttattaacgt 
gatatcagtg 
tcactctgtc 
tcctgggctc 
ggcaccaggc 
ccaggctggt 
gcactttggg 
ccaacatggt 
gcgcctgtag 
tggaggttgc 
actccgtctc 
tgcctcagct 
gagattatta 
tacacacaat 
ataactacat 
tgcccttcca 
tgtctataga 
actcttctgt 
tttgcatatc 
ccacagtttc 
attggaacta 
atagcaattg 
ctggagaacg 
gagcaccagt 
caccccagag 
ctggcttcag 
gcttactggt 
taatgtcctg 
acacttctcc 
ccccaagctc 
gtcagtgcta 
tcatccttac 
aggaattttc 
ggagatgagg 
ccgggggcgc 
ctccagaggg 
agaagtgctt 
cgcagaacag 
gcacctggtg 
gggccccaaa 
tcccttggca 
gatattgtca 
gtttaaaaac 

gggggctgca 
gggtgtgttc 

atgttacact 
acacaactaa 
tgcagaagca 



tgagccaccg 
tttttgagac 
tcactgcaac 
tggcattaca 
tttctccatg 
gcctcccaaa 
atctatttgt 
agagaaacct 
tccgtgctct 
ttttagatta 
ttttctgcac 
cgtttagagg 
acccacgcta 
aaggtatcct 
ccagctaaaa 
ctcaaactcc 
aggccgaggc 
gaaaccccat 
ttccagctac 
agcgagccga 
aaaaaataaa 
tctcaaatcg 
cttattatta 
aaaatgcact 
aagcatcact 
gccaaacttc 
taagttttgc 
gtctggcttc 
aatagtttgt 
tctactcacc 
tatttgcaca 
gctcacgtga 
aggaaagcca 
atggaggtgg 
cagcccctca 
tgtggatgtc 

attcttacct 
tcattcagag 
ctgcaacacg 
ctgagctcca 
ataactgact 
acagttgaca 
tgccgaagtc 
ccatcaaacc 
ctgacagcac 
gcatcaggaa 
ggcatgggtg 
gatgtctgag 
acgacaggct 
gccaacgctt 
aggtgggtgg 
aacagttttc 
aggcaaatca 
actggtacag 
gcttttggaa 
ttaattgaat 
cccctctgag 
ggtgaagact 



cgcccggccg 
agagtttctc 
ctctgcctcc 
ggcattcgcc 
ttggtcaggc 
gcgctggaat 
ttttgagaat 
acattcacat 
tagccctcct 
ttttttcaca 
tttgattttc 
gtccttgttg 
gaggacagtg 
cccacctcag 
tttttaattt 
tggtcttggc 
gggcagatca 
ctctactaaa 
ttgggaggct 
gatcatgcca 
aataaaaata 
ctggaattac 
ctactttgga 
tattttaagt 
ataatgcaga 
cccactccca 
ctgttctaga 
cttccctctt 
tccttcgtat 
agtgcaccat 
gagagagaga 
ttatggaggc 
gtggtgtgat 
ctcgagctca 
gcgacacctc 
aggtacagaa 
cgaactggcc 
gctgtccctt 
tttcagaagg 
tgtggtctgg 
gtagagggga 
cctcccaaac 
tctgagaaag 
aaagctgagg 
ctgggaaaga 
atgtcctggg 
acacccacca 
ggaggcactc 
gatgcttggg 
catccctgac 
aatttcaggt 
caaagtcatt 
aatctatggt 
agtgtaaggg 
aagtccttga 
gtacaaaaag 
agcctccaat 
cctaagccgg 
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gagatgttaa 
tcttgttacc 
cagattcaag 
accacgcctg 
tggtctcgaa 
tacaggcgtg 
ttggaatgta 
atctcaaaat 
caccaacagg 
ctcaagacaa 
tttttttttt 
ttcttatgat 
gcctgatcat 
cctcctgagt 
ttctgtagac 
caggcgcagt 
ctggaggtca 
aatacaaaaa 
gaggcaggaa 
ttgcactcca 
aaaagaactc 
aggagtgagt 
ttttcaaatc 
ggccagtaag 
cacattccct 
accccagaca 
atttcatata 
tccgatgttt 
tgctgaatag 
agggttattt 
gaggaagaaa 
caaaaagttc 
tcagtttgag 
gaacaagttg 
ttcagtaaag 
ggcagctcga 
aactcctact 
acccaaagcc 
ccctgtagtc 
gaatatgtgt 
cacagtcgaa 
ccagatgagc 
ggtttattta 
aagagggatc 
tggcttctcc 
aaccaggcaa 
tctgtaaaca 
cgactccagc 
tgccccagaa 
agcctggtgc 
tcatagggca 
gtacacactc 
gttcgaagtc 
gggactgtag 
gttgcattta 
tctcaggagg 
ccaagatgga 
atattcccaa 



cttttaagca 
cagactggag 
tgattcttct 
gctaattttg 
ctcccgacct 
agacaccgca 
gtaatttggt 
caaaaagtac 
aaaccaatat 
tacagacata 
tggtcgctta 
tattatttag 
gcctcattgc 
agctggaact 
agggggtctc 
gtctcatgcc 
ggagttcaag 
ttagccgggc 
aatcgcttga 
gcctgggcaa 
ctgatcttaa 
caccacagct 
aacttcatta 
atgagtttcg 
cactcacaga 
gccactgatc 
aatggaatca 
ttgagattca 
tgttcggtgg 
tccagttagg 
gagggagaga 
ccgaatctgc 
ttcaaaggcc 
gggacaggaa 
caaggctgaa 
ggagctactc 
taaactgcag 
aggttagcat 
gtccacttcc 
gttgctcact 
aagccacatg 
tgtgtccaag 
cattttgagg 
ctaggatgca 
actggcttgt 
accagtgagc 
cgacttctcc 
cttcagagat 
aaggtcatgt 
atttagaggg 
agggtatttt 
tgtgattcta 
aagacagtag 
gatggtctat 
taatgtgtga 
cctcagacca 
catatgtccc 
atccccccag 



aatctttttt 
tgcaatggca 
gcctcagcct 
tatttttagt 
caggtgatct 
cccagcctac 
tagtgaaagt 
agaaagcata 
gattagtttc 
tttttttctc 
atacacctta 
agacagggtc 
agccttgaaa 
acaggcacac 
actttgtttc 
tgtaatccca 
accagtctgg 
atggtggtga 
actcagaagg 
caagagcgaa 
gtgatcctcc 
gtccagctac 
aggtataatt 
ataagtgtat 
aagagccctg 
tgttgttctc 
tgcggcatgc 
tttacactat 
tttgagggaa 
ggctcttata 
gatatttatt 
catctgcaag 
tgagaaccag 
agcagagcag 
cacagagggg 
tggcgttctt 
gccatggctt 
cccctgactg 
ctgtctctct 
tcctagcaca 
gggatcagag 
tgcaggtcag 
ttgcagggga 
ccgagcagct 
tggcgtcacc 
attaaccctt 
caaggagtga 
cgccagggtg 
gtgtgaatgg 
gaactttttg 
agctttaata 
cttatgtaaa 
ttacccttgt 
ttcttgatct 
acttttctgt 
ctggaagcgg 
cttggaagta 
tagccgcagc 



108300 
108360 
108420 
108480 
108540 
108600 
108660 
108720 
108780 
108840 
108900 
108960 
109020 
109080 
109140 
109200 
109260 
109320 
109380 
109440 
109500 
109560 
109620 
109680 
109740 
109800 
109860 
109920 
109980 
110040 
110100 
110160 
110220 
110280 
110340 
110400 
110460 
110520 
110580 
110640 
110700 
110760 
110820 
110880 
110940 
111000 
111060 
111120 
111180 
111240 
111300 
111360 
111420 
111480 
111540 
111600 
111660 
111720 
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tt cage age t 
tggctgeatt 
cctccccact 
gtgecatgea 
ggagcagaaa 
cttgcagccc 
actcctctgt 
ataggatgga 
ggagcatgaa 
cacccggccc 
tctgtttttc 
aggctgeagg 
aggggcacgt 
aggaaaagga 
atgctgacaa 
tccctggggg 
tctggcacaa 
aacaccaaac 
gcccacaacc 
aacatggect 
tgggagtcct 
tcccagtctc 

agtggaaaag 

cctccactct 
tcacaatcaa 
caacatcata 
atgttgaagt 
acagaatgee 
gtgactttcg 
aagggaggag 
gggtcaacac 
gggccaggtc 
aacactgcct 
ceggagggea 
tgtggctgca 
tgggagccag 
ggagtgtgca 
gaagggacac 
gcttgggagt 
ggaaatggag 
cagctgacac 
gctcctggtt 
atgacagcag 
gtgtgtgttg 
cgcccatgct 
cctctcccct 
accacaagag 
ettgetttag 
tagggacaat 
gtgaacttaa 
tgccttaccc 
cccccaggcc 
gctggaactg 
gctgcaggcc 
ttgetttate 
aggcaaatca 
ggcttttgtg 
taggtggagg 



gcttatggtc 
tgacttgetc 
gtcagcaatg 
cggggctggt 
gggacacaaa 
ctggccccct 
tctagcccag 
gttttattgg 
aatgaatgag 
ccaactgcct 
aacccaaatc 
gggagtcact 
cattccttct 
caggagctca 
catctgtgtc 
tagggggggt 
ggatgtctgc 
cttttttatt 
agaatggeca 
gctggtggca 
ttggaagtcc 
ccttgtagct 
tcctgtgcag 
tccttcttct 
aggcatcatt 
aaccategta 
cactgtattt 
ttggtgagcc 
aggagaagga 
atccagagaa 
cctcctgggc 
ataaaaagtc 
cattcacatt 
ggcaacacca 
tcaggatcaa 
ggcaggcccc 
tggctggaag 
accacagggg 
ggggaagagg 
gggatgctgc 

agggattccc 

ccctgcacac 
tctccagatg 
aatggatacc 
ctgcttacaa 
ccccacctcc 
cgtgggaacc 
cacagccagg 
tctagaaggg 
taggaggtcc 
caatcttctg 
caacagcagg 
atgeaggegg 
cccagtgctt 
gctgaagctc 
tgetgeagea 
cagagtggcc 
tagaggcaga 



ctccctacac 
tctccctgtc 
tagacaagac 
eattgtgggt 
tcacttcagc 
cctacagatg 
ctgtttgggt 
gaggacattc 
cctggaatga 
cccagcgtgt 
ctagagcaga 
agcccactca 
tggtaactca 
cacagtcatg 
tgccctccac 
ggtggtggga 
ttaggcaact 
tcctcagtct 
ggggcacacc 
ttcccaacaa 
aattccaaat 
tgeagtagee 
cttctgagtt 
ggctggaggg 
ttagggatag 
acagccagac 
ggtctcttgt 
tctgaggatg 
tgtgaaaggg 
gagagagega 
agacctctca 
agatgtcaca 
cataggctgg 
ctggtcagcc 
tgagcttccc 
caggccccca 
tgtgacttgg 
aggagaaact 
cctcagaaca 
caagtccaga 
tctgccccga 
tctccccaac 
ccagttggca 
tgcgtggtct 
acacgcctcc 
tgcccccatt 
tcatggagag 
gcagagagga 
atttccaatt 
taaaggacct 
gcccactccc 
agcaggaggc 
tgcagggctg 
ccttccaagg 
tccacacaag 
ggagggatac 
tgggagatgg 
gcccaggaca 
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cctctcttcc 
ccacctctgg 
aaaegcttag 
ggcaaatgtg 
gtaaggtaat 
gtgtctaaga 
tgcaagaaag 
tggaeggget 
ggg a gggtgc 
tagctcctgt 
gggctcttcg 
agagctctct 
ttctctttaa 
aacagagatg 
tcttgtaagt 
atgagggcat 
ccactgcctg 
tagtgtgaga 
ctggccctca 
tgtcaaaagt 
gtcctctgga 
atgtgacaat 
gcgtccacca 
catgtggaca 
agaaaaggaa 
tagcgtggga 
tagagcaggc 
caggatgetg 
cagggcttcc 
gatcagctca 
ccacccattt 
gagctgtttt 
agtcatcaca 
caggggtggg 
acagagaaaa 
gectcaggtg 
agccctggcc 
gactttcact 
aaggtggggg 
aagtctcccc 
gcagaacagg 
ctccccttgg 
ctgtgggtta 
ctttctgaaa 
ttctgeaagg 
cctctcccgc 
tgggtgcttc 
ggaaagaagg 
tggaggggca 

gggggaattt 

agctccagac 
ccaccagcag 
gaacacagag 
ggacccaccc 
ggtgccccga 
agggaaggaa 

agccccctcc 
tttgaggcag 



ccagacagcc 
atttagtcca 
ttcacgtgcc 
agcaacacac 
ttgtgataaa 
ataaacccca 
agactcactc 
cccagcaaga 
ggcctcggct 
ggcaactccc 
ctcctcacac 
cctgttggtc 
cagcccgaca 
catctggagc 
gecagtaagg 
ctcttctttc 
gcatgctctc 
gtgatcacct 
gtagatgggc 
ctcagggatg 
ctcaaaataa 
actccagcca 
tgggcatggg 
aageggggee 
gatagaagga 
gagaaaaata 
tgactgatac 
caatccaacc 
tttgagggag 
gtgetcttea 
ttgggggggc 
cgtagaggcg 
gagtctggee 
gcacaggttg 
ccctggtcac 
gggtcccaag 
taccaggtga 
tctgcccagg 
agtcagaaca 
tagtcctcta 
gctctcctat 
tcccaggcac 
gctgggtatt 
agccatctcc 
cccggagtgg 
tccccacagg 
ctctgtcttc 
aaactacagc 
gtctgggaag 
ggggagaaga 
ccacctccag 
tcccaggacc 
egggactcag 
aagatgacac 
gcaatcagtt 
gectttggge 
cctcacctct 
aacagatgta 



cccaaacatc 
tgttctccac 
cacctactgc 
gaagcctcaa 
tgtcatgtaa 
ctaacatgtg 
cagttgcgtc 
gtctggcaat 
acacaaagtt 
cacttctctc 
catctccacc 
cctgattcgc 
cagggtctcc 
aggaataagg 
tagtacccag 

aggggecaaa 

tccctaggaa 
cttccaggaa 
actgagacaa 
agctcacagt 
aaactacatt 
atgggaagtg 
tgtgcttcta 
atggtgagcc 
tcctgagccc 
aaattctatc 
tctgttgaat 
aaagtccagg 
aaatggagag 
gtgtcctcca 
aagaggcttg 
ggcaggactc 
actttctcct 
aggtctcaca 
aggaggcetc 
aggattggtg 
ateagctgea 
tgacgggcag 
tacgtgtctt 
agccagccgc 
ctcctgaggg 
catcctctaa 
atatctgccc 
accgaattct 
actcagagag 
ctggctccag 
ctccctagcc 
aaggaggatt 
taggcagect 
cgaggtgggc 
gtgtagcagg 
cagagcagct 
agcagccaca 
cttctaactt 
tagagtcgac 
ctaagagctg 
gaccccaaag 
ttccccttgg 



111780 
111840 
111900 
111960 
112020 
112080 
112140 
112200 
112260 
112320 
112380 
112440 
112500 
112560 
112620 
112680 
112740 
112800 
112860 
112920 
112980 
113040 
113100 
113160 
113220 
113280 
113340 
113400 
113460 
113520 
113580 
113640 
113700 
113760 
113820 
il3880 
113940 
114000 
114060 
114120 
114180 
114240 
114300 
114360 
114420 
114480 
114540 
114600 
114660 
114720 
114780 
114840 
114900 
114960 
115020 
115080 
115140 
115200 



WO 01/78894 



PCT/US01/12245 



40/154 



tcgctgggcc 
atgtgtggct 
tggagagtca 
ccaccctcag 
aagggagtgt 
agtgactttg 
tgactcagtt 
aagaggcctt 
agcctgctgg 
tgggcaactg 
cacagaattg 
agcaatagct 
gatgagggct 
cttcttttct 
ggctggagtg 
agtctcctgc 
taattcttgt 
tcctgacctc 
gcccctgcac 
cccattttac 
aatatccgtg 
gtgagatgtg 
tgactggagc 
cttgagcacg 
ggtcattcca 
ggcagctgag 
gtgcacctgc 
gcctctggg.t 
ggtatagggc 
cagattaggc 
ccagcccctc 
ccactcacct 
tgtgcaaaca 
tcgggccacc 
cagcaccttg 
ctgtatcaca 

aggggcatct 

tcattgccta 
agaagtgaca 
tgtctctttc 
agtcctttgc 
gccctggcca 
ggtgccctgg 
caggaatgag 
atagagtgtg 
aggcacggca 
agccgggccc 
gccttaccct 
gagcagctgt 
ttggggacag 
tgggcgggcg 
tccaggaggt 
cggagtcaca 
ggccctggct 
ccagacagga 
gctactgggt 
tgtgggcttc 
tccaaggtga 



tcagcgcctc 
ctgcaggagc 
ggctggcgag 
gctccacgaa 
ggtgattgca 
ctgctcctcc 
tgaccaattg 
gtagcttctg 
gcacatgtgg 
cagccacatg 
agcagataac 
gactgataca 
tatcgtgtgc 
tttctttttt 
cgttgtggtg 
ctcagcctcc 
atttttagta 
aagtgatccg 
cagccattta 
aaggtcaccc 
aagccttcac 
gggagactaa 
agaggcagca 
agccacgtgg 
gaaccatgca 
gttcagggca 
gggcggagga 
gggacctctg 
tttccaaggt 
ttctcaaggc 
tcgtggactt 
tctacctgca 
taggtgtcat 
tggacatggc 
ccatcatgag 
gccatggatc 
gtgggcaggc 
gctgcctggg 
ccttcccagg 
tctagcacct 
ccactcccac 
aggaccctct 
gcatcctggg 
agtgaggcag 
ggtgcgtggg 
gcctccggag 
agccggacac 
ccagccgcag 
agaaaccctc 
aggctgctgt 
ggcggctgtc 
aggtcaggcg 
ggcggcagcc 
ccccacccct 
ctcaccccag 
accgagttcc 
tgtcattcct 
taggtctgga 



gatgtccacc 
accctggggc 
cggctcgctg 
gaccatcatg 
gaaaatgacc 
cattaagagg 
aagaaagaag 
ctctacttgt 
cccaactggc 
aggtgtgaac 
atgacttttg 
gtgtaggcct 
cagacacact 
tttttttttt 
atcttggctc 
tgaatagctg 
gagacggagt 
cccacctcag 
tttcttatac 
aaccagacag 
tcttggcaga 
ggcggaggag 
gccccagtgg 
gtgaaggcga 
aaggtggagt 
gcgccctcag 
tagagagatg 
aggggcctct 
ggaaccatgc 
agagctttct 
ggacctgaga 
accgcccgat 
cacctgcagg 
ctgtccctga 
atagggccag 
tggccctgga 
agggcacaga 
gttggtcagg 
agtcctgtgg 
atgtcctctt 
cagagcccag 
gctcacagag 
cctggcaaga 
ctggaccctc 
cagcagggtc 
ccaggatcac 
cacagtgggc 
ctccagggac 
atccctgggc 
cgaggaggcc 
cacagtgcac 
caggttgcgg 
tgcaggaggc 
atggctccca 
cgccccctac 
ccactggaca 
gtgagtccca 
gaggcggggt 



ctcagggcat 
tgactggagg 
tccactcggc 
gtgggcgtct 
acaaatttct 
cagaacctat 
tgatgttgtg 
tggaacacaa 
agccaggagc 
aacagaagaa 
tttgaagcag 
tggtgaaggc 
cagcacgctt 
ttttgagaca 
actgcaacct 
ggattacagg 
ttcgccatat 
cctctcaaag 
aacttgcagg 
cgggagagcc 
cagaagagga 
ggggttcaca 
ggagctcagc 
gagtgggcac 
tgcccacagg 
gcacgtccag 
attggggatc 
gcacattgtg 
cccctccagt 
cagaccagac 
gcatcagagc 
ggtgctgatt 
cacatcttgc 
tgccaagctg 
ctcagcaggt 
gtcccggaag 

tggggacctt 

ctgaggcctc 
gcagagtgct 
tccccaactg 
aagcaccctc 
gacttgcagg 
gtaggcgcct 
ctgcagccaa 
cgcacaggtc 
ccgcacacct 
ttccatccca 
gtgttggcct 
tgtggccctc 
aagaggcgac 
agtaccaggg 
ggcgcgtctg 
cagtttgcag 
accacccagt 
tcttaatgct 
cctgtcgggt 
ccctgcatcc 
gcccgaccag 



tgggggaagc 
ccaccagtcg 
aatccaggat 
tgggagggtc 
ccctccctgt 
ttcctcactc 
caacttcaga 
caatgtggga 
gtgcagtcat 
ctccccagct 
taagttttgt 
tggagtggga 
tatttctttt 
gagtctcagt 
ccacctccca 
tgtgtgcctt 
tgaccaggct 

tgctgggatt 

atgaggcgaa 
aggattccaa 
ggaagaactg 
ctcacagagc 
caggcagtgg 
aggctccgcg 
cccagggcca 
gccaggctct 
tgtaggcctt 
ggaaggtctc 
gggagctgtg 
agcccgctct 
ccctcctcct 
gage c cage a 
acctgcagcc 
tggaccccgc 
ggctcactgt 
gaggacagga 
gcttaggcac 
tggaccaatg 
ccaggcaggg 
cccattcctg 
caccaccctc 
gcagcaggac 
gcatgagccc 
cgaccgttgt 
actgtgatgg 
gtgccaagga 
gtgggcttgg 
ggeccagagg 
gcagctccag 
cggcgtggct 
ccagctgccc 
cagggcatga 
gtggcattca 
ctgaggcact 
ccttgcccag 
tccggccatg 
agccagactc 
gggggcccac 



caggacatgg 
actgecaagg 
gccccggagg 
tgtggggagg 
acccatgcac 
cttgaatctg 
gccagacctc 
agcccaggcc 
ctgagaccac 
gagcccagcc 

tgtggtcaat 

gaccaagact 
ggttttttct 
ctgtcaccca 
ggctcaagtg 
catgccgtgc 
agtcttgaac 
acaggcataa 
tgatgtattc 
cccaggactc 
aggtgggcgt 
acaeggagea 
tacatcccag 
tgcagccgcc 
cccaggaggc 
gccaccacgc 

gggggctgga 

agtctgactt 
ccccctccac 
agtgggagct 
cccctctctt 
agttttgggc 
gtagggcgtt 
tgctcgtggc 
ccacagtgca 
cagcgtcctg 
cctgtactgc 
gccacttcct 
cttataggac 
ggcccactgc 
ctcctgggca 
gggagctgcg 
gcgtggccac 
ggtaccaagt 
gggcaccttc 
ggecaggate 
cctaggccct 
getgegggea 
gcgcagggtg 
gagggecage 
gccatggctc 
gaggcttaca 
aactcaggag 
gctcctctgc 
tgctccacta 
ccgtgatccc 
acagaggacg 
accgcagcgg 
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taggaggtgg 
tgctgcccat 
caggtcaggt 
acagctgggg 
gccacctgtc 
agccttgggt 
gctcctgagt 
gaccctggct 
ctgaaattcc 
aggttctcct 
actattcccg 
acacacacac 
agtcagctga 
catcctccag 
gcagcctggg 
gatccccgtg 
ggcccagtcg 
ctacacaggg 
catgtgacac 
aagtccttac 
ggggcctggg 
agcgtggccg 
ccctctggga 
tcaggtgatg 
gctctccacc 
cccaacccca 
ctatgccttg 
tcccaccacc 
gccttatggg 
gtgttagctc 
atcatcccag 
ggaagggaat 
agactccagg 
cccctgggtg 
aatgctaagc 
ggaaaagctc 
gaaaagatac 
cgtggtggaa 
tttcctggtt 
atccaggcta 
cgattcgcct 
gctaattttt 
gtgatctgcc 
ggcctgaggg 
acacacacac 
ataagaatac 
taatataaat 
aacaagttat 
cctagtggct 
tctgctggct 
attcacagca 
tatcttcaga 
caagaacatt 
cctctaactt 
ggggacccag 
ggcggctgtc 
ctgacagctt 
ctagctcacc 



catccctgac 
cctggtacca 
tcaccagctg 
agagggaggg 
tccatgtggg 
ggcccaccta 
ggtcccaagt 
gggggcaaat 
ctgtttagtt 
aagaaactca 
tgcccccagg 
acacacacac 
ggccgaggcc 
catagccccg 
agagctgcct 
gagcagccgc 
tccaggacct 
tggggagtgg 
agcccaggac 
aggacacgtg 
cttggcaatg 
aggtcgactc 
ctcctgtggg 
gtgacacctg 
acacctctca 
tgtgctggag 
gcccctcctg 
tggtgggatg 
cactggactt 
ctgacaaatg 
ggccattgtc 
gttagcagct 
ccacaagagc 
tgaagcagga 
aagaacacag 
ctaaaactca 
acagaaccaa 
gaaacactca 
tatttgttta 
gagtgaagta 
gccttagcct 
gtatttttag 
cgcctcagcc 
aggattttta 
acacacacac 
acataatgta 
atacatatgt 
ttcagaaagt 
gctgcagtta 
ggctggaaac 
cccagcccaa 
agactgacac 
tgtggcctca 
cagggagttg 
gctggtggcc 
cacagtgcag 
tggacggtcc 
acattacaac 



tgtgacgttg 
tgtgtaggtg 
cccctcccgc 
cacaggacac 
tgagctactc 
taaatgtggc 
agagttccac 

ggggagacca 

ttatccactg 
agtctggagt 
cttctagaac 
acacacacac 
tggcccaggg 
tggatctcca 
tcgggtcccc 
agctgggccg 
gtgtccatca 
tcaggaccca 
tgggggactg 
gaggctgatg 
ataggcccca 
ctggaggggc 
tacctggcag 
gacccaggct 
cagtctggga 
ccgcagcccc 
ctccccactc 
gttttactgt 
gtgtgacctg 
ttggttcaac 
ctaactagga 
atgttatctt 
acaggacctg 
gaaccccagg 
accatgcctg 
aaccttcaat 
ggctcaaagt 
tgttcctttg 
gggagaattt 
caattttggc 
cccgagtaac 
tagagtgttg 
tcccaaagtg 
tagagactat 
acgtatgtgt 
tatttgtatg 
atgagaatgt 
gctcccacca 
ttacaggtga 
tgcccttacc 
agtctgacca 
tcacctcgga 
cagcggtagc 
gcctcagcct 
aggaggtgct 
atgaacagag 
gggggatctg 
tgccacacta 
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ggcaggggga 
agctgggccg 
acggtagccc 
cagtgagggt 
ctactgctgg 
ttagaactca 
agagtcctgg 
cagacctccc 
ggcatacttg 
ccaccatcgt 
cctgtcccac 
acacacacac 
tgttggaggc 
gacgcagtgt 
ccacaccttg 
gagggtcact 
gggtagtgag 
gccagggggg 
cattccttcc 
ggtgcagcta 
gcttgtgtca 
tggccatccc 
ctcaggacca 
cctgcagggg 
ccatgtgcgt 
ttccagacca 
tcctgcacag 
gtcagtcata 
taaccccagt 
ccataaagta 
tctggactca 
gagcgctctc 
gggaccaggc 
ttgcagggag 
agaccacagc 
ttcttcctgc 
aaatgtccca 
accttcgcat 
ttttttttta 
tcactgcaac 
tgggattaca 
gccaggctgg 
ctgggattac 
atatatatac 
atataaataa 
tgtatatact 
ttactatgta 
agcaagccac 
aaatatctag 
aggactgaaa 
gagtagtccc 
cctggaagaa 
tgccagagtc 
tagcctggaa 
ccccatggaa 
ccatgtggcc 
caggaacaga 
tgtcccccac 



tggagtgggc 
ggtgagtggt 
cgggccacac 
cttgaggctg 
ggagcccctt 
agcttgggaa 
aaagccctga 
ctcccctgct 
taaaatttgg 
taggaagggt 
ctcctctcct 
acactgacct 
ctcacagata 
gttgggggcc 
tagggtggag 
gtccacacgg 
tgtgacgtgg 
tccctcatcc 
cccacacctg 
ggctcgtggt 
aagttatggc 
gataccaacg 
cagcctggcc 
aaaaccaaga 
gtccacctag 
ccgccagggc 
gctccatcct 
tctcgcctac 
tccaggcact 
ctgaaaaggg 
gcatgggtga 
gccaggccag 
aaagggcagc 
tgagatggag 
tagcagggtc 
gaaataggta 
aaattggtgc 
tatgggcatt 
agacagagtc 
ctccatctcc 
aatgcccacc 
tcttgaactc 
aggcatgagc 
acacacacac 
agaatatgaa 
catacataaa 
acaaaattat 
tttgtgaatt 
ctggaggcaa 
aagacatatt 
cagggcatca 
gagtgaggtg 
cccaaggccc 
ccgaccatgg 
caaggccagc 
ctggcccatg 
gggagctgag 
ctcctgccac 



atccaggcgc 
ccacacaagg 
ctgcacattc 
tgtacagcag 
ggcctttggg 
tagcccactg 
gcccctccct 
gcactttctc 
tgtttataaa 
gggaatgggg 
cttttaaagg 
tgagcgtcga 
tagacaccct 
acagccacat 
gccacaaggc 
cacaggagga 
cgtggggcat 
agggctctgt 
gatgggacaa 
ggctgagcct 
tgcaaagcgg 
atatgaggtc 
ctcttggagc 
gcaggtgagg 
gactacacaa 
ccactcagcc 
ggcatttacc 
caaagcgtaa 
gtgcctgcat 
agggatttag 
ccgagggctt 
ccctgcatcc 
tcctcagtgg 
ggacagctgg 
acgcaactgg 
tgctaagtaa 
tgtcggtcac 
atctttcctt 
tcactctgtc 
caggttcaaa 
accacaccta 
ctgacctcaa 
catggtgccc 
acacacacac 
tatattatgc 
catatatgca 
atgatcgcaa 
gccattgcct 
agggaggcct 
tctgcacaaa 
gagtctacca 
ttggatgatc 
agttctcgga 
gatgggacct 
aaggccaggg 
tctaggaggg 
gccacccagc 
acacacagcc 
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taagccatgc 
ctccaccctc 
acgggagtgg 
tcagttctgg 
ccatttcgga 
gtcaagttgg 
gctggagagc 
gccgcccagc 
aaaagcaagc 
ctgggcagag 
gggagtcctt 
gaaggagcag 
cacagagtgg 
cctgccccca 
tggaggcgtt 
aagggttgtg 
aacagccccc 
ccttgtggag 
ctcgccgtcc 
gtcgaggggg 
cccacaagcc 
ccaagcagct 
caagctaggc 
aagaaggagc 
cataagtccc 
tctgctccca 
agcttatctg 
ttcctgaagg 
gatctcactc 
gtggctccgg 
gagcctcccc 
ttggtgtcta 
tgtcccaggg 
ctgggacaga 
ggccccagcc 
tgtgacccag 
gaggaggtgg 
agagcacagt 
ggcagtggta 
gaccctcgtg 
ttaggccgct 
tgatgaggag 
ccccacatgc 
ggtagctctg 
attggtagta 
caacacctgg 
tgaggcagaa 
ctccatctca 
agtgcagtgg 
ctgcctcagc 
tacatgtagt 
ctggtgatcc 
gccagctttt 
ggctggagtg 
attatcctgc 
taatttttgt 
tcctgacctc 
gccaccatgc 



tctttcctcc 
caagtcccca 
agagctgggc 
ccacgggcag 
accaggagaa 
cttctgtgcc 
aggcggcaca 
ctggaaggag 
tagctcacag 
agggttgcag 
cc tcagagac 
ctccgctctt 
ttttgcctga 
ggccacccat 
gcccagggca 
aatctccaca 
acaggtgctg 
cagctgcagc 
agccccggcc 
gtctgcaggg 
tggctaggct 
gtgcagactg 
tcatgctccc 
accattttct 
atctgctgct 
ccctgcaccc 
tgatctccaa 
cttctggaga 
agacaccaga 
cacacatgtg 
gcttggcgga 
tgggagccca 
ctcacccatg 
ggagcccaca 
tcatttccta 
ccctcccgat 
ggtggctggt 
gaga a c age t 
tgagccggcg 
aagcagggct 
tctgcagctc 
aegggegget 
tgecctatat 
ggctttgtgg 
gatggctgea 
cttggctggg 
ggatcgtttg 
aacaattttt 
tgtgatcctg 
ctcccaagta 
agagataggg 
acccgcctca 
tttttttttt 
cagtggcgca 
ctcagcctcc 
atttttagtg 
aggtgatctg 
ctggccacaa 



ccaaacccca 
ggctgcccat 
accagcctca 
cagtgtcacg 
gttagcaggg 
ctcctgaagt 
gcttactgac 
caggaaggag 
ggcagectag 
tacagggaag 
cagctgtgtt 
cctgaatgtg 
ttagatcctc 
tccctgcctg 
ttgctggcct 
cgcagcaagt 
cagccacccc 
ctggcggggg 
ccagcggcat 
aggaagaaca 
cccagaatac 
tgcactgcac 
caagctaagc 
agctcagaca 
gagcagggtg 
aatgecttae 
gccagctcct 
cagttggcca 
aaactttgea 
tcaggggaag 
tgcagagggg 
gcacagcctg 
aggacaaagc 
gggctggcat 
aacttcagcc 
atagaccttg 
gggtcccggc 
ggegaagagg 
tcagtgctgg 
ccattcaggt 
agtgtcactg 
geggggagag 
agaaagcett 
tgcatcagga 
gcatggttga 
catggtagct 
aggecaggag 
ttttttttga 
gctcactgca 
gctgggatta 
tttcaccatg 
gcctcccaaa 
tttttttttt 
atctgagctc 
ctagtagctg 
gagatggggt 
cctgcctcgg 
tttttttttt 
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gcccggcccc 
gccagtcacc 
gtcaggatgc 
gtctccaggg 
ctgccagcag 
gtgtgtgatg 
cacccccggc 
gccccctggg 
gagaagtggg 
gaggacaagg 
gcctatcccc 
gaggagggca 
cteggagcag 
actcaccctg 
cacagaggta 
tgggggcttt 
ctgatggcag 
ggtcgctgtc 
caaggtccag 
tgggcactca 
actggacaaa 
aagggcagee 
tgtgccctga 
aaggtgctgt 
cccacctagc 
cacggagcct 
gtgatactag 
aaagagagca 
gaggcagctg 
cctctggaat 
ctgtctgtcc 

tggggcatgt 

atgg'atgtgg 
tggagggtaa 
ccacgtggaa 
gggcagggca 
aggaggctgc 
ggccactggc 
aggecgeggg 
accaggagaa 
cctgtccttc 
gaagaggctg 
ccagggttct 
gggtttgccc 
agattcagag 
catgcctgta 
ttccagacca 
gactgagtct 
acctctgcct 
caggtgtgcc 
ttagecagge 
gtgctgggat 
ttttgagacg 
gctacaacct 
ggaccacagg 
ttcaccacgt 
cctcccaaag 
ttaattagct 



tgcttctccc 
cacagaggac 
ggcaggcgta 
gaccctgggc 
cttcccggct 
gtgcaatggc 
ccccaggagc 
gtgcccacag 
ccctggggac 
cagcccaact 
ggectgaaca 
gegaagggee 
aacagtccag 
gecattgaag 
caagccctcc 
ggtgaccttc 
ggaagtggcc 
cacacggcac 
cctggtggtg 
tcccacggat 
ggcagatccc 
agaccagggt 
tgcctactga 
atgggctggc 
aatccctggg 
cccctcaaga 
egaatgeata 
ateagecaag 
ccctctgaca 
caggcccttg 
ctgtgctgag 
gcacaaatag 
tgcccatgct 
agacatgggg 
agacccactg 
ggacagggct 
aacaggtggt 
actgtggccg 
gagcaggagg 
gegggcatea 
caeca cc teg 
ggaagggtcc 
cctttcccca 
tttaatgeca 
ccagagccca 
atcttagcac 
gectaggcaa 
cactctgtct 
cccaggttca 
accacaccca 
agate tcgaa 
tacaggtgtg 
gagtcttget 
ctgcctccca 
tgtgcgccac 
tggecagget 
tgctggaatt 
gggtgtggtg 



cagaccaccc 
actcaggagc 
aagggcagca 
ccacagcacc 
cacgttgcaa 
caggacagtg 
caggggtcca 
ggttggggag 
tgtgggggcc 
gtcccagctg 
gagatcatgg 
cccactgctg 
agctccagcc 
gtggctgagg 
tcttccagca 
atgcgtgggg 
acaacaeggt 
aaaaggaggc 
aatgttggtt 
gctccagggc 
gaaggctgee 
gggtggaact 
gtcttcccag 
tgccaccttg 
ggtcatgctg 
tctgccctgc 
gaatggcctc 
gcccccatgt 
gttcegggae 
gtcttacacg 
gggctttgee 
aggagtgtgg 
gagcttgagg 
aggtagagaa 
tggccaggtc 

ggggggecta 

ggctgctcac 
tcccgggccc 
ctgctgccgg 
ggtgtggggc 
gctgccgggc 
ctcctctcaa 
cactactgta 
aatcagatct 
ggcttatgtc 
tttgggagac 
catagtgaga 
cccaggctgg 
agegattetc 
gctaatgtta 
ctcccgacct 
agccactgtg 
ccgtcagcca 
ggttcaagca 
cacacccggc 
gatctcaagc 
acaggcatga 
acatgggccg 



122220 
122280 
122340 
122400 
122460 
122520 
122580 
122640 
122700 
122760 
122820 
122880 
122940 
123000 
123060 
123120 
123180 
123240 
123300 
123360 
123420 
123480 
123540 
123600 
123660 
123720 
123780 
123840 
123900 
123960 
124020 
124080 
124140 
124200 
124260 
124320 
124380 
124440 
124500 
124560 
124620 
124680 
124740 
124800 
124860 
124920 
124980 
125040 
125100 
125160 
125220 
125280 
125340 
125400 
125460 
125520 
125580 
125640 



WO 01/78894 



PCT/US01/12245 



43/154 



tagtctcagc 
tgcagtgagc 
tcaaaaaaca 
gaaggacagt 
tttttttttt 
tatcttggcc 
ccaggtagct 
agagatggga 
ccacctcggc 
accctacttt 
tttcataatt 
ggtgcaaaag 
tctgccgttg 
cctgcggttg 
ctccctccac 

ggaggttgca 

ctccaggtct 
gctgtgatca 
tgtggccagg 
ctccaggaag 
cctgtcaccc 
ccgctttgtc 
tgaatacact 
caaggccacc 
agacctggtg 
acaagctgac 
cagagggcat 
aacagcaggc 
cagtctcaga 
attgtattct 
atgtgaatat 
acctgtaatc 
aagaccatcc 
gcaggatggt 
cttgagcccc 
ggcaacacag 
cctgatcatg 
aagaggcctc 
tatgcatatg 
tgtgcataca 
tgtatgtgcg 
gttaataaat 
ttttcataag 
gcccaggtgt 
gttcccatgc 
tgtgtccagt 
agcgctcgct 
tggccaagtg 
accagctgta 
tctggttctc 
catagggagt 
tctcagccaa 
agcctcctgg 
cccttcgtgg 
catccaggaa 
tccctgaggc 
ctgaagatgt 
caggtgtaga 



tatttgggag 
catgaacata 
aaaagaaaaa 
gcacataccg 
ttttgagaca 
cactacaact 
gggactacag 
tttcaccatg 
ctcccaaagt 
tctatttggc 
aaaaaacaat 
gaatgatgtg 
ggtcactgca 
agtccactag 
tctccctggc 
tttccaaggg 
cggatctcca 
ctgtccccgg 
ggctcactga 
gctgtcagga 
tcagacaagg 
ccacactgcc 
tttgtccttg 
ctgcaagtgg 
agggacagca 
acagagtgcc 
ggtctcaggc 
ctgcactctg 
ctaacggaga 
ggaccagaaa 
agattatagc 
ccagtacttt 
tggccaacat 
gggtgcctgt 
agaggcggac 
tgagactcca 
atcattgcat 
atgtctgcaa 
tatatagagt 
cacaagtgca 
tgtgtgtgtc 
gggaaatcta 
tttaaacttc 
gcagagctgg 
ccaggacccc 
gccaccccac 
gccatggacg 
cagccggagg 
gcggagatca 
caggatggga 
gctggggggt 
tcagcctcaa 
agtgctctgc 
gggtattgtc 
gcactgggtt 
ctgagctcct 

ggaggctgat 
cgccagcatc 



gctgagatgg 
ccattgcact 
cctggcttga 
aaagttggaa 
gagtctcact 
tccacctcct 
gaactcacca 
ttggccagga 
gctgggatta 
ttcccacatc 
gaatgcttct 
acctgttgtg 
cagcccctgt 
acttctagaa 
tgccagtgcc 
agttggtggc 
ggcgcaggga 
aggtggaggc 
ccacagagca 
ctggagtgag 
gcattccctg 
ctgcgagaag 
gctgggctgg 
ggataaaagc 
cctcagccag 
tcctgacagg 
cattcaagtc 
caagtttcaa 
atgaagagac 
aaaaatggct 
ttagataaca 
gggaggccga 
ggtgaaaccc 
aatcccagct 
attgcagtga 
tctcaaaaaa 
tgtggatatg 
cttcaggaat 
gtccatatat 
cctgtatata 
tgtgtagaca 
ggttaaaggt 
caaatataaa 
gacatgagct 
aacccttcct 
cctggcctct 
ttctgcacct 
gtatgggagt 
ctgggtgcct 
cctgctgggc 
cccagccaac 
tctctgcaca 
atctctttgt 
acctgtctca 
atgccagttg 
ccctattctc 

ggggggtgag 

gctccaggct 



gaggatggct 
ccggcctggg 
tcaattagct 
gaccgtactt 
cttgtcaccc 
gggttcaagc 
ccatgcccag 
tggtctcgat 
caggagtgag 
tgactgctag 

gagggactct 

tttccctttt 
cttttatgtg 
ggaacagact 
catcttacca 
tgagcacttg 
gttgggacca 
caggatatga 
gtgaaggatg 
aggcgggtct 
gataccctga 

ggggtgatcc 

gtacgtcact 
cccacttcaa 
gtgaccaaag 
gcagagcagc 
aggcaaatgt 
cgtcaagaat 
gcaacgaccc 
acaaaagaca 
gtcttctatc 
ggtaggtgga 
tgtctctaca 
actcaggagg 
gccaagatag 
aaaaaaaagt 
taagagaatg 
ataaatatgt 
gtataagtgc 
tgcaagtcta 
gcgagaaaga 
atataatcat 
ataggaggga 
caaggccacc 
tggggctcca 
tacggttgac 
cacagaagta 
gggcatcctc 
cgttgggtgt 
tcacctggac 
ttccagcccc 
tcctacacca 
cctgctcacc 
tggtaaggca 
ctggccctgc 
tggccaatac 
accaaagagc 
gcctggggca 



tgagcccagg 
caacagagca 
accatgccct 
ttcttttttc 
aggctggagt 
gattctcctg 
ctaatttttg 
ctcttgacct 
ccaccgcgcc 
catagagcct 
ccaagtttag 
tcccttgact 
gcgtagccag 
ggagcaggct 
ttggcatgga 
tactccccac 
gaggtaccac 
cccccatgtg 
cccacaagtc 
gcgtggagac 
tacaacccgt 
caaagtgcct 
ctgttgtccc 
agatgagaaa 
tgagcatcat 
atgtccgcgg 
ggagtgaggg 
gacaaggaaa 
aatgcaatat 
gtattaggtc 
agccaggtgt 
tcacgtgagg 
aaaacacaaa 
ctgaggcagg 
caccattgca 
cttctatcaa 
atcttaggaa 
aggtagatac 
acatgtccac 
tatccataca 
ttaagcaaat 
tatttgtctt 
actagggagg 
tccaggtaag 
ccatctggtc 
taccacgctg 
gaagccagta 
cagcaggaca 
gttgcagact 
ctcagccact 
agccccatac 
cccgcctgct 
aggatccccc 
gggctgggac 
cctgccctgt 
gcaaccttcc 
ccacgccgtt 
ggtgcagcac 



agtttgaggc 
acaccctatc 
caggaggagg 
tttttctttt 
gcagtggcgc 
ccttagcctc 
tatttttagt 
cgtgatccac 
cagccagaag 
gctcccagag 
ggtcagggta 
tccaggaagc 
ttagctcagt 
cctcctcagg 
agtccagggt 
tgtcagtttc 
tgaagcgtgg 
acagcaccag 
ccgcctgggt 
gagaggtggg 
gacctctgca 
gagtgcctgc 
agtcagtgtt 
actaatagag 
cagcaatggg 
ccttcccacc 
acctcctcac 
gactgaggaa 
gtgaactgtg 
cactggtaaa 
agtggctcat 
tcaggagttc 
aattagccag 
aggagaatcg 
ctccagcctg 
tgctaatttt 
tttagcggta 
ataaataaca 
atagagtgga 
tttatatgta 
gtgccaaaat 
tctctgcaac 
tcaaagcatt 
tggccttgaa 
ctgctctgac 
acagggcccg 
tcagccctag 
tggttcttgt 
agtgtcactg 
gcaagggcag 
agtccccggg 
gctacagggg 
tgacacccca 
tccacacctg 
ctcccctccg 
caagaactca 
ctcagcttgg 
accagtcttg 
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gtttggaggc gtaccccatc cttgagccac ttaatggaac tgactgcagg gtagctgctg 12918 0 

ttcacctggc aggtgagtgt gaccagctca cctggaagga tgttcctccc cgaggggctg 12924 0 

aggaggatct tcacacccct gggggcatct gcaagtcaca gtagggggta ttgggtaagg 1293 0 0 

tgcttgggga gggcagagga tggcacactt cttcttgccc ccctttaaaa gctcagtcct 1293 6 0 

aaggaagtat gcccagataa aacagcagtc cccttcaatc ctcacccagg gcatgctctg 12942 0 

tctgcccatg tccgtccctt tcccgccccc tgcgcctgat gcattcctat gcccattgag 12948 0 

agctgatcat gtgacgcttg gccagcgtcc agcctacccc accagctgta gttttctgct 12954 0 

tcccaatgct gtccattgct cctgctctat ttgggatgag ctccacacac agggtcatgg 129600 

gtaccccact gctagcttta gtggcctgtg gaagctattg gtaagggacc aactacctag 129660 

tgggaggggg ccaaaggcag catcaaacta gctctgaaaa tagttaccca gtttgtaagc 12972 0 

aagaggccaa caacacaaag aactgcattt ccttaaatct tgttccaaag cctctctctg 12978 0 

tgtattcaag tgttttattc ttattttttt agaaagagga tctggctcag tcagccaagc 12984 0 

tggagtgcag aggcacaatc atagctcact gcagcctgaa actcctgggc tcaagtgatc 12990 0 

ctcctgcctc agcctcccga ggaactggga ctacaggtgc aagccaccac atccagctaa 12996 0 

tttttgtttt ttaatttttt tgtagagaca gggtctcact atgttgccca ggctggtctc 130 02 0 

gaactccttg cctcaagcaa ttctcctgcc ttggcctccc aaagcactgg ggttacaggt 13 008 0 

gtgagccact gtatccggcc tcaagtgttt aatttgtgcc aggcactctt ctaaatcctt 13014 0 

gacctgggtc atctccttta tttgtttttg ttgttgttgt tgttttttgt ttgagacaca 13 0200 

gtctcactct gtcacccagg ctggagtgca gtggcacaat cacaactcaa tacaactcca 130260 

cccccggggt tcaagcaatc ctagtgcctc agcctcctta gtaactggga ttacaggcat 13 032 0 

atgccaccac acccggctaa tttttgtatt tttagtagag acgaggtttc accatgttgg 13 03 80 

ccaagctggt cttgaatccc tggcctcaag tgatccaccc gcctcagcct cccaaagtga 13 044 0 

tgggattaca ggcataagcc accgcgcccg gcccatctcc tttaattttt atcataaatc 13 050 0 

tgtgagacag gaaccatcta ttgttatctt cgtcatagac tgagaaaaca gagccaggca 13 0560 

ggaaagggat aaatcccacc tccccagcat cctccagcac ggggtaaatc ccggggcact 13 0 62 0 

tgctgccaac cctgaagctg catgggagct cctgttcccc aaccctttgt gtctcccttt 130680 

ctctgcccag cctggcctca aggacaagcc ccttcgaagc agtaggaggt tgagggaacc 13 074 0 

atttaacgaa gtccttgggg ctcagcagtg tctctccact tgccctaccc tggaggccca 13 0800 

ggagcagcct ggttttgcat caggagcaag ggttccgttt ctgtgggctg gagaggggct 13 0860 

ggtttctgtc aggagcaaca gatgcgctca gccacaaggg tgtgtcccca gacaactcac 13 092 0 

acttcacttg gaggtgaatc tcgctctgag ccctgtgatt ggccacggag agctggcagc 13 098 0 

gcaggatccg gccgtggtcc tgccaggaca tggccatgtg gagggtctcc aggtggccga 13104 0 

cgccggtggg ctcaaacttc tggctgttga aggtgacaga gcgagcaggg tcctggcctt 13110 0 

gccactgcag tctgacctgc tcctgcaggc atacgtaggg agtggagcag ttgaagtcca 131160 

cctctgtgcc ctcgagaagc tccaccgggg aggcaatggt gggcaccctg ggctcctctg 13122 0 

aggacagaga cagcagtgct caggacccgc ttttgccacc cctgagatcc ctcgccctgg 13128 0 

aaaccccagc tgaggagaga gccctgggga gggggctttt aggggaagga aagacgcttt 13134 0 

ctactccagc cccacttggg tgaatacaag ggagaaccag gcctgggcct acgccggggc 1314 0 0 

tggaacagag gctgagactg gctggggtta gattcaggac aagggctggg gctgagagcc 131460 

aaggggtcca gaagcagctt gggaatccct cccggggggc agccaggcca ccccacttat 131520 

cacctgttac tgtgaccaag gtgcctttca catctgacca gcggttgacc tcactgatct 13158 0 

cgaagcggaa gttgtaggaa ccagagtcct cgggctgcag gtccttcagc agcaggttgc 13164 0 

acaccctgtg ctcggggttc cccatgaact cggtgcggcc gcggaagcgg gcctccacca 13170 0 

gcttggggtc cgccgagtgg ctcaccacct gccgctggcc cgagtagtcg tagtaccaga 131760 

tggccgtgat gccgtcgggc acctccacgt cggcagggaa gctgaagatg caggggataa 13182 0 

gcaggcaaga ccccttcaca ccctgcacgt cctggggacfc ggagacgccc catgaggcct 13188 0 

ggcctggggg aagaacggca gggggacaga ggggagggtg atacaggcct cagggtgcca 131940 

cagagccctg cgacctgccc cagagaaggt gccccagctg ggctcccaaa ttctgccctg 132000 

ccccgagaaa tgcacactta gagcagccct tctcagtgcc ccaggggtca gtccactgcc 132 06 0 

cgaggctgct cagaggtttg gtagggtggc tcgaagacag atggcagctt cctgcccagc 13212 0 

tcctggccac tgcccgattg ggccctccct tgacctcagc aaccaaacat gtgacccagg 132180 

catagtctaa tctgccaagg gctggactta tgaaggctgg accctggaag acaggcacta 13224 0 

ggcccgcaaa gcttacctgc tgggaagaat gaggccagga ggagaagctt gggcaagaag 1323 0 0 

cccatagcag gttcttgtgc tgctcctgtt gcctaagagg gtggtgcgca ctgcgctggc 132360 

tgggctcaca ggggcctcca gggacacctc tgggcacttt agccccagca cctgctagaa 132420 

gtccgagcct gtgtccccac ctcctctgct ggccaaccca ataagagggc agggctctta 13248 0 

aagacctctg agtcagacac cagcagagag ccaggaggcc acgttcccag ctcaggctgt 13254 0 

gcccaggaat gccctcactt ggtggcctgc ctcagaaaag cctgtgtgtc ccttggccct 132600 
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atcccaagtt 
tttctccctc 
gcccagagat 
aggcttgtct 
tgccgaagtg 
tgcccgtgag 
ctccgcagac 
ctatctccac 
atgaatttgg 
tctctatttt 
actgaagata 
tgactaagag 
gaggtggcaa 
ccacaaatat 
catggagtgc 
agagatgcca 
atcttcatga 
aaaaaatagg 
gcctgtgaac 
aagacgagcc 
gtggtcccag 
gaggctgcag 
ctgtctgaaa 
ttgggaggct 
ggtgaaaccc 
agtcctagct 
acggtgagcc 
ctcaaaaaaa 
aaaaggagga 
actaatatcc 
catgaaatcc 
atataactgg 
aaatcaaaac 
acaaacaaag 
tgtattgctg 
tcaaaaagtc 
aaaagtactg 
actaaaaggt 
ggtatgtaca 
gaaccttgaa 
ggttccaatt 
tagaggtaac 
agataatgaa 
atgtcatggg 
caccacatac 
ggccaagtga 
gcccatccgc 
ggaagggggt 
gcagaaggca 
gacaccgctc 
atttatggag 
tctgggaaga 
ctggggccct 
gggtcagatg 
gctatcttgt 
acctgatgga 
gaccctttcc 
ctaaccccaa 



ctgctttccc 
ttccccactt 
gcccagttac 
cctaccacct 
ctgcctttaa 
atgtgaaaag 
ccactctggg 
cacagataat 
cttccagctg 
ccttgccttg 
tgtgccccat 
aggggtgagg 
ggcaagtcta 
gagctcaatg 
ccctggtcag 
gaaacacata 
ccttggatta 
taagttggac 
ccaacacttt 
tgggcaatac 
ctactctgga 
tgagccatga 
agaacaaaca 
gaggcatgca 
cgtctctact 
actcgggagg 
aggatcacgc 
aaaaaaaaaa 
caacccacag 
agaatatata 
aactccaaaa 
ctgataagga 
caaaccacag 
caacacagaa 
gtgggaatat 
aacataggat 
aaaacaggga 
ggaaacaggt 
tacaatggaa 
aacattatgc 
acatgaaata 
caggggctgc 
aaagttctgg 
gtgtacactt 
acacccatgt 
acgaggaggc 
ctcaggggct 
gagccgaggc 
gaaggtgccc 
cccaggggac 
gcccaagggc 
tgcaccccca 
gggacctgga 
tgtccctggc 
cccagctccc 
agaatggtgg 
aggaccactc 
aggggtcagc 



agcccctcaa 
ccctgccccc 
tcactccaag 
gccttactaa 
cagtcactcc 
tcctcatttc 
ggcagttctg 
ctttgcctgc 
gaacttctcc 
ttttttacag 
agcagtgcgg 
tctttagggg 
ggtttgaaag 
tgtgcggagt 
ggagtgtcta 
taagtgaaac 
ggcaatgctt 
ttcatcaaaa 
gggaggctga 
tgcaagactc 
ggctgaggtg 
ttgcaccact 
acagctgggt 
gatcacaagg 
aaaaatacaa 
ctgaggcagg 
cactgtactc 
aaaaggatat 
aatgggagaa 
aagaattcct 
atgggcaaaa 
catgaaaaga 
tgagatacca 
aacaataaat 
aaaatggttc 
taccatatga 
ctctaacaga 
caagtgtcca 
tatcaatcgg 
tcagtgaaac 
tctagactac 
agggaggggg 
aaatagtggt 
aaagacagtt 
tgccaacttt 
tggaagtggg 
gggctcagca 
ccagggccag 
cagcaggggg 
ccagccctct 
tcatctccag 
gcctccacac 
cacctcaccg 
cacaggggat 
ctgttcctcc 
ggttctggac 
ccatcacata 
cccaccggaa 
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ggataccacc 
tccccaccac 
acacccagct 
ctatagacca 
cctgctcaaa 
ctgcccccag 
tctgctcaag 
tgaaacctta 
tttgaggttc 
cttgtgagag 
agcctaccag 
ttcagagctg 
gtagcatgag 
gtaccatatc 
aggggtacca 
tataaaactc 
tcttaagtat 
tttaaaactt 
ggcaggagga 
tgtctctacg 
ggaagatcgc 
gcactccagc 
gcggtggctc 
tcaagagatt 
aattagctgg 
agaatcacct 
cagcctggtg 
gaatcaaagg 
aatatttgta 
acaatgaaca 
cacttgaata 
tgctcaacat 
cttcacatcc 
attggtgagg 
cgtcactgtg 
tccagcaatt 
gtacaccaat 
tcagtgaatg 
ccataaggag 
cagccagaca 
gtatattcag 
agctaatgtt 
gatggttaca 
aaaacagtaa 
gcaatcctcc 
gacaggaaac 
gattccaatg 
tcaggctgac 
cacagtacag 
cgcaggctgc 
ttctctagga 
caggttcttg 
tgaagaaaag 
gtcagggtca 
catttggggt 
acagcgaccc 
atgtagaggc 
tccagcctat 



ctaaggcaga 
ccaaaccaac 
agaatgatgg 
taatggggct 
aaccttctgt 
ctctgtcccc 
ggctccctag 
ttcaaccttc 
ctgtagctac 
cccatttctc 
gcattcagca 
aggttcagag 
agcgctgtgg 
caaaaaggca 
tagacctgag 
ttagaagaaa 
gaaattaaaa 
ggccagacac 
tcactttagc 
aaaaattaaa 
ttgagcccag 
ctgggtgaca 
acgcctgtaa 
gagaccatcc 
gtgtggtggt 
gaacccagga 
acagagtgag 
acactatcca 
aatcatttat 
acaacaacca 
aacatttctt 
cactaggcat 
gttagaatgg 
atgcgaagtt 
gaaaacaatt 
ccactcctag 
gttcacggca 
gatgtggata 
gaatgaattc 
caaaagggca 
agactgaaag 
taatgattgc 
caatattgca 
attttacatt 
ctggtcctaa 
tcatgacctc 
actaccaggg 
caggtgggac 
ggcgggattg 
tggagtggac 
agccctaggc 
gccactggag 
cagcctgctg 
gctctgctat 
cctgaaaagg 
tggaacaggg 
cacctatgct 
tggctcagcc 



tgaacagctg 
aggaactgga 
tttcttcctg 
ttactgaact 
ggctccccat 
attccctgct 
ctgcccagct 
aaggagcagc 
cccagagcta 
aggacctaga 
aaccccttag 
ttggagtggg 
aacacatacc 
ggccctcaac 
cccaaaagga 
acaggtgaaa 
gcacgaaaaa 
agtggctcat 
ccaggagttc 
aaacaggcct 
gggaggggtc 
gagcaagacc 
tcccagcact 
tggccaacat 
gtgcacctct 
ggcggaggtc 
actcttctgt 
gagagtgaat 
ctgataaagg 
ccaaaaaaac 
caaagaagat 
taggaaatgc 
ctattcacaa 
gaaattcttg 
gggtcattcc 
gtatataccc 
gcactattcc 
aacaaactgt 
taacatatgt 
aatattgtag 
tagaatagga 
tgagtctctc 
aatgtaccta 
atgtatattt 
atgctgactt 
ccagctccca 
gtcacacctg 
ttagcctgct 
ggacaggaag 
tgatctggcc 
ctcctcctct 
aatgatatag 
ggcacactgg 
ggccagggca 
gcaatcgtga 
cgcgggggag 
tagcccggcc 
tgtcaccaca 
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aaggccagct 
aggagatctc 
cccccgcaac 
gggggaattg 
tgcagccctg 
ggaggttcct 
taaatcccca 
cacagtgggc 
ggttggctgg 
gaaagggcag 
tttcctgaca 
ggtgacatgg 
aattcctgga 
aattctcaga 
atttcacact 
aaactgtgct 
gtggaattgt 
gtcagggtga 
attgaggttg 
ttaagatacg 
ttgggatgat 
gtgaggtgag 
gagcagggtg 
gaagggtttc 
caccactcgc 
ttcagcccag 
gggtacctgc 
aggaaggacg 
gagaaggatc 
acccagaggc 
ctccataggg 
ggaccgctgc 
ttcaccagcc 
gaagggggct 
ctggggtggg 
ggagcctccc 
ggggacataa 
tcctgtacgt 
ttgcttttga 
gctcacggca 
gctgggatta 
gggtttcacc 
tcggcctccc 
accttcaccc 
attactgcag 
atgttaattg 
gtttgtttgt 
tggtgcaatc 
ctcctgcttc 
ttttttaagt 
ctgggctcaa 
caccactcct 
ctgcctacct 
caaggccacg 
cgcaggccat 
tacagagata 
.tgcttgggcc 
tcaacagctg 



tcagcccaga 
tgtccctgtt 
ttgggttgca 
cgggtcccgg 
aaccatccct 
gaggttcata 
ctcctactat 
actcacgttt 
ggccccaagt 
aggcagtttc 
ccaaggcttc 
ccagctggag 
gcagattatg 
gccatagaat 
tcattcctta 
ggacagcacc 
gctggggtta 
tgatcaggtt 
agtgatgggt 
gtggaggctg 
ggatggttgg 
ggtgagataa 
atggtcggga 
agttgagtcc 
acccagatgg 
ccccgctcct 
tccacagaag 
gccaagagct 
ccgtatcagg 
tggtcaggga 
gaaggccctc 
aacgctggag 
aaatccacac 
tcctgcacac 
gatgcactac 
tgtaccactg 
ggccccaatg 
tttttggttt 
gatggagtct 
acctccacct 
caggcatgca 
atgttgtcca 
aaagttctgg 
ccaccttggg 
cccatcttat 
ctccaagcaa 
ttttgttttt 
ttggctcact 
agcctcctaa 
tattttgtag 
gtgactctcc 
ggccattttt 
gtactggtgg 
tgtgcaggtg 
ggtgccccaa 
ttaggtctct 
atttcctctt 
gtccggagac 



taactgttct 
tgaaagcctg 
gctgtggcct 
gaaagggggc 
cccttgggtg 
cttgggtctc 
tggtatgtgt 
gctgcctgtt 
ccctgacctc 
cctatggaca 
tccccaactt 
gttactggcc 
gtcaaaatag 
ccataactaa 
atctggctct 
agaagccctc 
gagtaaggag 
ggaattctgg 

gggggtaagg 

ggttgagatg 
gattagatga 
ggttggaata 
ctgggattgg 
atctttgatt 
agcccccccc 
aatgctccac 
gaagtggctg 
cctggtgcag 
ggctgcttcc 
ggcagctgca 
ggcaaggcca 
cctccccagg 
cccactctct 
age t etc etc 
acgatgccca 
agtactctta 
tccaggacac 
ttggcttctt 
cactctgtcg 
cccgagttca 
ccaccacatc 
ggttggtctt 
gattacagag 
acaggaaagg 
tgagggecta 
tcccagatac 
tgagacagtg 
ttggctcact 
gaagccggga 
agaeggagtc 
cacctcgacc 
ttttttaatt 
ggagctggcc 
taaaaggatg 
agcagatgtc 
atcactctgt 
teggecaaaa 
acccagctct 



ggaaacagaa 
gagttgaggg 
agtgagcacg 
ggtgtgccag 
actcttttgg 
caagtcttgg 
tgcatcaggg 
tgagaccaag 
tgattccaga 
gctaggcccg 
ccccattggg 
cacagatagg 
tggaaatccc 
tgcttattgg 
aaatgettet 
ctgtggacgg 
agattatcgt 
aagcaaatta 
gtgaaggttg 
aggatgatag 
gatgagtaaa 
ggggctgggg 
gatgeaggtt 
agtgtcttgg 
cgacccctgc 
tcaccctctg 
eggeggtget 
ctggctcccc 
tctttcccaa 
ggccttgcgc 
gggacttagg 
caaagtgaga 
ctgagcccct 
cctgacacct 
gaccaaggca 
agaaccccca 
tcctcaggtt 
gttgttggtt 
tccaggctgg 
agtgatcatc 
eggctaattt 
gaactcctga 
gtgagecget 
aaagccccca 
gtttgtgcag 
aagcaacaag 
tettgetctg 
gcagcttcga 
ttacaggege 
tccctatgtt 
tcccaaagtg 
tcacaactca 
acagcaaatt 
tgaaacccta 
tggtgacatt 
tcctctttat 
cacaaagggt 
aggcctgtgg 



agagcaggga 
gacagtgccc 
cagcgccccc 
caacagggag 
aaatcattgt 
gtgctctgaa 
tggcttgagc 
tgcctcagga 
ggccaagttt 
ggtgtaggat 
ctagagaagg 
gagtcagggt 
caatcacagg 
ctttgacatg 
ggctggagtg 
acgaagtggt 
tgggctgggc 
aggctgggat 
gagttgggtg 
agtcggagtt 
tggttagggt 
ctggggctga 
ggaaggattt 
acttgggttg 
ccctatcccg 
ggcccaggga 
ggacctgegg 
agggctctgc 
agcctcagct 
aatgccaaaa 
gactccagca 
aatggagtgg 
tagggaggct 
gagagggagg 
gttattctcc 
agaggcagtc 
ttctctttcc 
tttttctttt 
agtgcggtgg 
ttgcctcagc 
ttgtattttt 
ccccaagtaa 
gcacccggcc 
ccacagctag 
gcacttgaca 
actttttttg 
tcgcccaggc 
aattctaggc 
ttgecactat 
cccaggctgg 
ctggaactac 
aacttaattc 
aggcccctgt 
aatcagggtg 
ccaccctgaa 
agctcctgtg 
tcatcccatt 
ggttgtgaca 



ccgctcagaa 
cgccccccgc 

tggtggtcga 

caggcagctc 
tccccagaca 
gacaggattt 
tggcctggta 
ggtcttggag 
agctggggaa 
tcagtttctg 
aagaacacag 
aeggatggge 
ccaaatgttt 
ggcagtagaa 
ctcactctcc 
gatggatgaa 
tggggctgag 
tggggtggac 
caggtgatgg 
gtggttaggg 
tggggtgcaa 
ggtagggtca 

gggggtggtg 

gtttggggtc 
ctcagccagt 
ccagggacag 
agaggagaac 
eggctcaaga 
ctactgtcca 
egggaaagae 
agcagaagtg 
gggactccca 

gggggaggtg 

cgcgcccagc 
aagecattea 
ccgtgttgtg 
cctctcactg 
tgttttgttt 
cacaatctca 
ctctcgagta 
ggtagagacg 
tctgcctgcc 
actatcctat 
ttactgttac 
tgtgaagcag 
gtttttgttt 
tggggtgt cc 
ttaagcaatc 
geaeggctaa 
tctcaaacac 
aggegtaage 
aactcagtcc 
accctgaggg 
gatccagaat 
atgetcagge 
tccactccta 
acttcctccc 
catgggtacc 
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aatccttcag 
cccttgactt 
cacaaactaa 
tgccatgcaa 
cagatgacct 
caccagacca 
tggctacatt 
ttacctttcc 
tgcttcctgt 
taggctgggc 
gatcgcttga 
cgaaaaatac 
aggctgaggc 
cactactgca 
tgtaatttct 
tttttttttt 
tgcaatggca 
gcctcagcct 
gtattttttt 
agttcccagt 
agccaccata 
tagagaattt 
tagaaaacac 
gtaagtgttg 
tattttttgg 
tgcgctccag 
ccatgccctc 
taccagccac 
agtgaactat 
taacaacaac 
tagaagctag 
gtgaatcctg 
agcaaccagg 
gcccctgccc 
cagagccctc 
cagtctatag 
tgggagtcat 
caggcatatc 
gtcctctgga 
aatgttggct 
ctccacccga 
cctcacccag 
cacccacttc 
ctccctcctc 
aagccagaga 
aagacccatg 
acacatgcag 
gccggctggg 
gaatggcgtg 
agcctgggcc 
gcacggaagc 
gaggtcagaa 
acaaaaatta 
cttgaaccca 
ggtgacagag 
cacaccaaca 
tccaatacca 
gatggctggg 



tccactggga 
tgaccttagt 
gtcttgaaac 

ggggcaggct 

gaatcagacc 
gtcaacctgt 
agcaagatga 
ttagcatgag 
catttccttg 
acggtggctc 
ggccaggagt 
aaaaattagc 
acgagaatcg 
ctccagcctg 
cagtaggtca 
ttttttttgg 
caatctcggc 
cccaagttgc 
gtagagatgg 
caagtgatcc 
cccagtgaat 
gccaaagaca 
caaataacgg 
gcatccaaga 
ctcagtcatt 
ccactgtatc 
tactttaaga 
tcaggaggct 
aattgtgtca 
aaaaaaggta 
aagttagtca 
ggggcccagt 
accttaattc 
acctccttgc 
tgcttcctca 
aggcccaagg 
gtagctcact 
catcacctgc 
aagcttgtat 
ctgctgttgc 
ggtgacctct 
tctgtcatct 
tgtccctgag 
aacacaacca 
cttccaggga 
tacatgtcct 
tccccaaagg 
cgcggtggct 
aacccgggag 
acagagacag 
ttatgcctgt 
gtttgagacc 
gccaggcatc 
ggaggtggcg 
caagactcca 
tctagttcaa 
ctgagagcac 
ggatgggaag 



ctctatacat 
catatgactt 
gtacttaagc 
ccaactaacc 
catggattga 
ggatgcgtga 
tacaaggcct 
gagagcaaag 
agtatacagc 
acgcctgtaa 
tcaagaccag 
tgggcctggt 
attgaacccg 
gatgacagag 
cactgttaca 
agatagggtc 
tcactgcaac 
tgggactaca 
ggcttcacca 
acccaccttg 
tattctcgtt 
aaactttttg 
tgttttaaat 
tgatatgaca 
gtcaatgggc 
tgcattcggg 
taactcctta 
gaagcaggag 
ctgcattcca 
attctttaaa 
catggtctta 
gctaagaagt 
agtcattcat 
cagtatctct 
gccttcagac 
gagatctttg 
ggctaaaagc 
tctacgatcc 
gaagagttcc 
ttctcttatt 
actctcatag 
gtccctctgg 
tgtttcaagg 
caaggtctag 
aagtcaacaa 
cccgttttat 
acaccgtctg 
cacgcctgta 
gcggagcttg 
actctgtctc 
aatcccagca 
agcctggcca 
tgtaatccca 
gtcacagtga 
tctaaacaca 
gatggtggac 
agtaagcaaa 
tatccactgc 



ccaccccttg 
gctctggcca 
agtttgcctt 
tgctggtcca 
aacaaaactc 
atttattgct 
aagtcccaga 
acttgtctac 
aaactaggta 
tctcagcact 
cctggtcaac 
ggcaggcgcc 
ggaggtggag 
cgagactctg 
cattcatcta 
tcactgtcac 
tttgacctcc 
ggtgagtacc 
tgttgcccag 
gcctcccaaa 
ccagatagga 
gttgaaaaaa 
aaaatagaag 
actccacaat 
tgcttccagt 
aaagcaggat 
aattgctgat 
gttcacttga 
gcctgagtga 
ttgtacactt 
agtctttatt 
agagggagat 
tccttggacc 
gcagagcctc 
ccacttgctg 
agacccatac 
ctaggtcctg 
caggagtcac 
cagggcctgg 
gggtatctga 
ccacaccctg 
ctctcctctc 
ctctggggtt 
gtggaatgac 
gcccaaggat 
tccctgctca 
tacagagtca 
gtcccagcta 
cagtgagccg 
aaaaaaaaaa 
ctttgggagg 
acatggtgaa 
gctactcagg 
gccaagattg 
cacacgcaca 
tgagaacttg 
gggaaaagga 
acaggatttt 



gcttcatggt 
atggatgtgc 
tccccctgaa 
aagaggatga 
agctgagccc 
ggatgctgct 
acaacacacc 
cttgattagt 
aataaataaa 
ttaggaggcc 
gtggcgaaac 
tgtagtctca 
attgcagtga 
cctcaaaata 
ataacaatta 
ccaggctgga 
taggctcaag 
accagacgca 
gctggtctca 
gtgctgggat 
aaaccaagtc 
aataagtttt 
tgtttcaccc 
catgaaacta 
cattgtccaa 
ggaggaaagg 
gtggtgggtc 
acccaggagc 
cagagtgaga 
tattgctgct 
ctaacaggtg 
gaagccctgc 
tacccagatt 
ctgcctcccc 
tgtctcaggt 
tgtagtcagc 
catagagctg 
tcaatgactc 
aacatcaaaa 
aggccgtact 
gaccctcact 
ctgttcatgc 
ttcagagaca 
ccactaaggg 
gcccgttaca 
gggtctgggc 
gatggagtta 
ctcgggaggc 
agatcgcgcc 
aaaaagaata 
ctgaggtggg 
accccatctc 
aggctgaggc 
tgccagtgca 
cgcacacata 
tctctgccat 
gacagaaggg 
gatttaattc 



gggaaagcat 

acggacatga 

tttctgtcat 

cgaacacatg 

agcctacgtc 

gagaattttg 

cagaacttgc 

cagggagcac 

aataactagg 

gaggtgggca 

cctgtctcta 

gctactcagg 

gccgagatca 

ttttaaaaaa 

ttctttttct 

caggctggag 

tgatcctcct 

gccaattttt 

cactcctgag 

tacaggtatg 

atagggaggt 

gctacaagta 

tctccctcaa 

gatccctttt 

agtggctgct 

acaaagtagg 

actcctgcaa 

ttgaggctgc 

tcttgtctct 

tatattccac 

atgtgcccga 

cctcctgtgg 

ccagcctcag 

cagagggtgc 

ggggacaact 

ttggggcaga 

ggttcaaatc 

agaagctaga 

gagctctgag 

ctcctctctc 

cattcagtca 

atcttgtact 

ctgaagggac 

accggctctg 

agaaagttaa' 

atacagtgga 

agaacattta 

tgaggcagga 

gctgcactcc 

tttagtctgg 

cggataacct 

tactaaaaat 

aggaaaatca 

ctccagcctg 

tttaaaccac 

tcctggccca 

ctgggaacag 

tagaagatag 
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aaagagggag 
ctgagagaaa 
accccaaagg 
gcagaggaaa 
tgaattggtt 
agaaacaaac 
agatgggtgg 
ctgtctctac 
cccgagtagc 
ttgaactatg 
aacaacaaca 
ggctagaggt 
atcctggggc 
cctagaggta 
gagtgagact 
aataaaaacc 
ttatgacaac 
gctggagtgc 
ttctcctgct 
aatttttgta 
agtatcaaac 
acaggcatga 
tagcaaccct 
atggtacagt 
aacttgaacc 
tatttattta 
tggtggctta 
gccagcagtt 
ctggagtgca 
cctcccacct 
atcgaaaaaa 
aactcctggg 
tgagccaccg 
caatcccgga 
tagtgccaat 
tcttggaaat 
acatctgtgg 
tttcctgcfcg 
cacgagagaa 
actgagggca 
tggatcaagc 
tcaaattcca 
gagctaccat 
aaaaaaccca 
aaggaatgct 
cacaggaagg 
ttcttctctc 
ccaggaccaa 
aagaaaagag 
ctttagggta 
ttctgcagta 
aacctgaaga 
aaactttgac 
tagcactttc 
ccaacatagt 
gcacctgtag 
cggagtttgt 
ctccatctca 



gatcacgttc 
gctgccccag 
. gttggtagct 
taccctaggt 
accaggtatt 
aggccaggcg 
atcacctgag 
taaaaataca 
tgggaggctg 
attatgtcac 
aaaaaggtaa 
tagtcacatg 
cccagtgcta 
gaggttgcag 
ccatctcaaa 
tacagcaaag 
tttttttttt 
agtggctcaa 
tcagactccc 
tttttagtag 
tcctgacctc 
gccaccacac 
ctgagctaga 
aaaataacct 
caggcaacct 
tttatttatt 
cgcctgtaat 
cgagaccagc 
atggtgcaat 
cagcctccca 
aaaaattata 
cttaagcaat 
tgcctgacca 
gggaacagat 
atgttcataa 
ttaaaataga 
ttttggtttc 
tattctcaag 
gcatgttgct 
gatgaaagat 
cttacctgaa 
tagaaggact 
caagggaaac 
agtcacatca 
ggataaacac 
tgggcacagc 
agaaagaagt 
atagaaagaa 
aatccatttg 
gaggaaatga 
aatgacgttt 
taaagcatgg 
catgaaaatg 
ggaggccgag 
gaaaccccat 
tcccagctac 
ggtgagccaa 
aaaaaaagaa 



aggaacagat 
aggccggagc 
ggcacacgta 
gagttccagt 
accagacagg 
cattggctca 
gtcaggagtt 
aaaattagct 
aagtaggagg 
tacattccag 
ttctttaaat 
gtctcatgtc 
agaagtaggg 
agagcggaga 
aaataaaaaa 
aacaaagagc 
tttttttttt 
tctcagctca 
gagtacctgg 
agaccgggtt 
aggtgatcca 
ccatccagtg 
tgctattgtt 
tgaccgaggt 
gtttcctgag 
tacttactta 
cccaccactt 
ctggacaaca 
catggttcac 
agtagctggg 
tagagatggg 
ccacccacct 
atacttcctc 
aattcaagga 
aacaaaaaaa 
attgctgaaa 
caaaattagt 
ccctgtgggt 
gctcacagtt 
atatactgac 
gctaaaggga 
ccaattatcc 
tgccagacct 
ttgaactgaa 
aacaactact 
aagtagtctt 
aagaaaggca 
aacacatctg 
accttgacat 
tactatgttt 
atacaatcat 
aagatttcat 
taaaaaagag 
gcgggtggat 
ctctactaaa 
atgggaggct 
gatcacgccg 
aagaaaagaa 



gcgggtaaag 
agaagtggac 
cctctccacc 
atcaatcaac 
tgaggaagac 
cgcctgtaat 
caagaccagc 
gggtgtggtg 
ttcacttgaa 
cctgagtgac 
tgcacactac 
tttattctaa 
ggagatgaag 
tcatgccact 
taaataaata 
tattgcaccg 
ttggagatgg 
ctgcaacctc 
tattataggc 
acaacatggt 
cccgcctcag 
ttctgacaat 
atctccactt 
ttacatggct 
ttggactctg 
cttatttatt 
tgggaggctg 
aggagacagg 
tgcagcttcg 
accataggca 
ggtctcacta 
tgccctccca 
tttaataaac 
atacaagaaa 
agcagctatg 
taaaaagaaa 
tcactcttct 
tggtggaatt 
cattatgagg 
cctgggtcct 
tgagtgcttc 
ttgcctcagt 
tggtgacaag 
agagctgcag 
tcatctcact 
tctggttgac 
attggagaac 
aagcatgatt 
ctgaaacagc 
ccttctcaaa 
aacatccgtt 
tataattacc 
cccaggcgca 
cacaaggtca 
aatacaaaca 
gaggcaggag 
cggcactaca 
aagaaaagta 



gaaaccagag 
tctctacagg 
cccacatgta 
ctgccctttg 
caacacaaag 
cccagcactc 
cttgccaaca 
gggttctcct 
cccaggagct 
agagtgagat 
attgctgctt 
caggtgatgt 
ccctgccctc 
gcactccagc 
aataaataaa 
ttactgtggg 
agtatttttc 
tgcctcctgg 
acatcccacc 
ttcaccatgt 
cctcccaaag 
tttatactta 
acaggaaact 
ggtaagtggc 
aactactcag 
gagatggggt 
aggggaacag 
gtcttgctct 
acctcccagg 
cccactatca 
tattgcctag 
aagtactaga 
taaacaaaaa 
acttatgaaa 
aaaagaaagc 
ttcaatagaa 
ttgggtaaaa 
gacacctctt 
agccagcagg 
tgtggacatc 
tcactgttaa 
tgtgtgtcca 
cccacccact 
ttccccgaaa 
gcacctgcaa 
ataacaaaga 
cagaaacgac 
ttgaacaatt 
aaatattctc 
ggagctatct 
ggttttcaat 
gaacagcatg 
gtggctcacg 
ggagtttgag 
ttagctaggc 
aatcacttga 
gcctggtcaa 
gacagcagaa 



ccaaagcacg 
gactcaatac 
acactgcagg 
ttcataaaga 
agaaagatcc 

tgggaggcca 

tggtgaaacc 

gcctcagcct 

cgaggctgca 

cttgtctctt 

atattccact 

gcccgagtaa 

ctgcttgaac 

ctgggtgaca 

taaataaata 

cctggcagta 

tgtcacccag 

gttcaagcga 

acacccggct 

tggccagtct 

tgccaggatt 

ttaactcatt 

gagtcacaga 

agagaaatga 

ccatactgcc 

cagccgggca 

atcacttgag 

gtcacccagg 

ctcaagcgat 

tgtccagcta 

actggtctct 

attacaggtg 

gtgaacaaac 

agaaagattc 

aaaaaataat - 

aagggtgtga 

ataccctgat 

ccatttactg 

taaagacaac 

attgagtccc 

tagccactga 

gccctcggat 

gaaaatgggg 

gacaggagag 

agaatccagg 

caacatctgt 

aaaatctatt 

agtggggtgt 

catcaaggca 

ggttacgtgg 

tttcagaatc 

caaatgttac 

cctgtaatcc 

accagcttgg 

atggtggcac 

acccaggagg 

cagagcaaga 

attagaggga 
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gatggcaggt 
agaaattctt 
ctataagtag 
gacatgggcg 
agtagatgct 
agtgatgttc 
tttgggaggc 
catgatgaag 
tggtgcacac 
tgagagtgag 
actccagaat 
ttgcattttg 
gtcatatcca 
tgagcaatca 
aaatttatat 
cattgattct 
gggactaaga 
cttatacaca 
atttgaaaaa 
agttaggtag 
gggcttccag 
gatttttgat 
tatttgaaat 
gtatacatgg 
agtggctcat 
tcaggagttc 
aaatcagctg 
gagaattgct 
ccagcctggg 
aaagaataag 
acccaaaata 
aaaatggaca 
gggaggccga 
agtgaaaccg 
gtcccagcta 
gctaagatcg 
aaaaaatggg 
aagcatatga 
gtaccactaa 
gtgaggatgt 
ccatttttaa 
taacaattac 
cacagtggct 
aggccaggag 
cgctctgtca 
cccggcttca 
cgccaccatg 
tagactggtc 
gggattacag 
caacacagtg 
cacacacttg 
agtttgaggt 
agagaaagaa 
acgaaaagaa 
agaaagaaag 
ccatacaaaa 
gaagcaacta 
aatggagtat 



gacctggggc 
tctaaaattg 
caccaacaca 
gggagatggg 
ctcttgagtt 
aaactccaga 
cgaggtgggc 
ccctatctct 
ctgtagtccc 
accctctctc 
atataaaaat 
tttcatctct 
cagcccccaa 
ttcctgcctc 
cacccagatt 
ccacttcacc 
gacactacag 
gatttttaaa 
actcacaaac 
gtcaggaatg 
tcaacagaag 
gatgaggagg 
ctgctggtag 
agaataaata 
acctgtaatc 
gaga cc age c 
ggcgtggtgg 
tgaacctggg 
cgaaagagtg 
aaaacaagee 
cacaaagaac 
aacgggccag 
gcagggcaga 
tctctactga 
cttgggtgtc 
cgccactgca 
caaaagatgt 
aaatatgett 
acccctatta 
ggagtgacag 
gacagtttgg 
actcctagat 
cacacctgta 
ttctttcttt 
cccaggctgg 
agegattetc 
cccagctaat 
ttgaactcct 
gtgtgagcta 
atactctgtc 
tagtgtcagc 
tgcagtgagc 
agagagagag 
aagaaggaag 
aaaggaaaga 
ccctgcgcaa 
agaagctctc 
cattaagtaa 



tgggggaggc 
atgtatgagg 
ggactccagt 
aatgggtgat 
actaaattga 
aaaggctggg 
agatcactta 
actaaaaaaa 
agttattcag 
aaaaaaaaaa 
gaaaagaggg 
ctgtatatgt 
ccatccacca 
cttccagttc 
cccttccagc 
aaactggtca 
ctgaccctta 
aaaataaaca 
cacatagcta 
cataaatata 
gctattagta 
attagtgece 
ctggaattgg 
aataaataaa 
ccaacacttt 
tggecaacat 
tgtgcacctg 
agtcagaggt 
ggactctgtc 
acagattggg 
tcttaaaacc 
gcatggtggc 
tcaccttagg 
aaatacaaaa 
tgcggcagga 
ctccagcctg 
gaacagacac 
aacctgatgt 
aaatggtcaa 
gactctcatt 
cagtttctta 
agttacccaa 
atcccatcac 
ttatttattt 
agtgcagtgg 
ctgtctcggc 
ttttatattt 
gacctcaagt 
ctgtgctcag 
tctacaaaaa 
tactggggag 
cacgattatg 
agagagagag 
gaagggaagg 
aataaaaaag 
gaatgtttat 
caatgggtga 
taaaaagaag 



acaatttttc 
aacgaaattt 
aatcacacag 
ttccattccc 
gaaacaaaga 
cgcggtggct 
ageccaggag 
aaaaaaatac 
gatctcacta 
aaaaaaaaaa 
eggaagagtg 
tgaacttttt 
ggcatcaact 
gtcttcctgt 
taccttaatg 
ataattgtct 
aacaacatgg 
tattgaaaaa 
gaaatatcaa 
tatgttaatt 
attaagtatt 
ccaaccccca 
gcaactaaat 
taaataaata 
gggaggccaa 
ggtgaaatac 
taatcccagc 
tgcagtgagc 
tcaacaacaa 
agaatatatt 
caacactaag 
ggtggctcac 
tcaggagttc 
attagecgae 
aaatagcttg 
ggcaacacag 
ctcatcaaag 
cattaaggaa 
aatccagaat 
cgttgctggt 
caaaactaaa 
ttgaggtgaa 
tttgggaggc 
atttatttat 
agtgatctca 
ctctgagtag 
ttagtagaga 
gacctgcctg 
ccaaggccag 
aaaatttttt 
gctgaggctg 
cctctgcact 
agaggaagga 
aagggaggga 
gaaaagaaaa 
agetgettta 
aaggataaac 
acttagcaaa 



ttacatagtg 
taaatataga 
caaacaaaac 
tgctttaatg 
taaagtatat 
catacctgta 
tttgagacca 
aaaaaattag 
ctgtactcca 
aaaagaagaa 
gaactgcggg 
ttttaaccta 
gctgtgttta 
actgeagagg 
ctaccacctc 
cttctaaaaa 
gtttgaactg 
aattttggag 
aaaaaaaaaa 
ggctgtttat 
tgagtcaaaa 
tgttgttcaa 
tcacgagatg 
agtatttgtt 
ggcaggcaga 
cgtctctact 
tgcttgggag 
egagattgea 
caacaaaaaa 
tgcaaaacaa 
aaatcaaaca 
atctgtaatg 
tcaactagcc 
cgtggtgacg 
aaccagggag 
tgagactccg 
aagatataca 
ttacaaattg 
gctaaaaaca 
ggaaatgtaa 
catagtctta 
aacttatatc 
caaagaggaa 
tttcttttga 
tctcactgca 
ctgctgggat 
cagggtttca 
cctcggtctc 
gagttccaga 
taattagtca 
aggatcactt 
ccagcgtggg 
aggaaggaag 
aggaagggag 
gagagagaga 
ttcataattg 
aaactgtggt 
ccacaaaaag 



atagggctca 
ttgtttagaa 
tgagaaaaca 
atgaggtgtc 
tgtttagagg 
atcctggcac 
gectaggcaa 
caaagcatgg 
gcctgggtaa 
gaagaagaaa 
gtagaaaatt 
catgtactac 
tatggagggt 
ctagaaaact 
attcagecat 
tattagaatt 
tgctagtctg 
agatgtgaca 
actgaaacag 
attattagta 
gttatacaca 
gggttagcta 
tggaacggtc 
ggcctggtgc 
tgacttgagg 
aaaaatacaa 
gctgagggat 
ccactgcact 
gcaattgtta 
atatctgatg 
accccattaa 
ccagcacttt 
tggecaacat 
tgcatctgtg 
gttgcaatga 
tctcaaaaaa 
gatggaagat 
aagcaacaag 
ecaaatgetg 
aatggtatgg 
ccatacaatc 
cagggecgag 
ggattgcttg 
gacagagttt 
atctctgcct 
tacaggtgea 
ccatgtcagc 
ecaaagtget 
ccagtcctgg 
catgtggtgg 
gagccccagg 
caacagagta 
gagaaaagaa 
gaaggaagga 
aaccttatat 
ccaaaaactg 
atatctgtgc 
taatatgeat 
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actaaatatg catttagtaa tattaatatg cattaagcaa taaaaaaaga cgtgtcaagc 15 0060 

cacaaaaagt aatatccata ttactaaatg aaagaagcca gtctgaaaag gctacgtgct 15012 0 

gtgtgatttc aactatttga ctttctggaa aaggcaaaac tacagacagt aaaaagatca 150180 

agggttgccg ggggttctgg ggacaggaaa ggatgaatag gtggagcacg ggaattttag 15 0240 

gtcagtgata ttattctata tgatactata atggtggatc catgatatgc ttttgtcaaa 1503 0 0 

acccatagaa agtacaacac aaagagtgat tcttaatgta aactatgtgc tttagtcaac 150360 

aatgtattga tattagttca ttaattttaa caatgtacca cactaatcca agatgttaat 150420 

aatggggaaa ctgcgtgtga gggaggggat acatgggaac tctgtactac agctcaataa 150480 

ttttataaat ctaaaacttt ttttttaaat aaagtctctt agaaaacaat aaaaataaaa 15 054 0 

taaaaagtca attttttttt ttttttttga gagggaatct tgctctgtca cccaggctgg 150600 

agtgcagtgg cacaatctct gctcactgca acctctgcct ccttagttca agcgattctc 150660 

ctgcctctac ctcccaggtt caatcaattc tcccacctca gcctcccaag tagctgggac 150720 

tacaggcatg cgccaccacg cctggctaat gtttgtattt ttagtagaga tggggtttcg 150780 

ccatgttggc caggctggtc aagaactcct gacctcaggt gatccgcccg cctcagcctt 15 084 0 

tcaaagtgct gggattacag ggttgaacca catgcctggt ctaaaatgtc catttttaaa 150900 

aggcagtctg gctggggcag tggctcacgc ctgtaatccc agcaccatgg gagaaggctg 150960 

agcagggtgt atctcttgag cccaggagtt cgaggctgta gtgtgctatg atggtgccac 15102 0 

tgcactccag cctgggtgac agagtgagac tctgtctcta aaataaataa ataaaaatac 15108 0 

aataaaattt taaaaggcaa tctgcagcaa gagaagatga agttagcagg agagcaggac 15114 0 

agaggagagt ccatgaacaa cagggagagg aaagtggtca tggtggcagc tcccaggctc 1512 00 

ctggcatatg ttcaattccc tgttcccagc cctcaggaag cccagatgtc cccacctgcc 1512 6 0 

cctacataca aaccctggat ccttgacatc aacttctcct ttccttcatg tgctctaatg 15132 0 

agtttttgtt acttgtggaa catttaacta acatcattaa ccaagtggac ctgtctcaga 151380 

gtggtgttat gagaagacag cagccaaaag acagctgcag ccaagcacag tggctcatgc 15144 0 

ctgtaatcct agcattttgg gaggccgagg tgggtggatc acctgaggtc aggagttcga 15150 0 

gaccagcatg gccaacatgg cgaaaccccc tctccactaa aaatataaaa attatccggg 151560 

tgtggtggcg agcgcctata atcccagcta cttgggaggt tgaggcagga gaattgcctg 15162 0 

agcccagggg gcagaggcgg cagtgagccg ggatcgtgcc acttcactcc agcctgggtg 15168 0 

aaagagcaaa actctgtctc aaaaaaaaaa aaaaaaaaga cagctgcaac aaatgtcaag 15174 0 

ttctgtgtgt tttcttttct tttctttttt ttctatttaa ttaatttatt ttagagtcag 15180 0 

agcctcccta tgtcacccag gctggagtgc agtggcacag tcacagctca ctgtagcctc 15186 0 

aacctcctgg gctcaggcga tccttccacc tcagcctcct tcctagctgg gactacaggt 151920 

gtgtgccacg acatctggct tgtgtgtttt cttttctttt tttttgagac ggagtcttgc 151980 

tctgccaccc aggctggagt gcagtggcgc gatcttggct cactgcaacc tctgcctcct 152040 

gggctcaagc aattctcctg cctccgcctc ctgagtagct gggaatacag gcgcacacca 15210 0 

ccatgcccag ctaatttttg tatttttagt agagacgggg tttcaccatg ttggccagga 152160 

tggtctcgat ctcctgacct tgtgatccac ctgactcggc atcccaaagt gctgggatta 152220 

caggcgtgag ccactgcacc ctgcctggct tgcatgtttt ctgacatact gtcaaaagga 152280 

tactcatact aaatggcaac acattctcaa gccccttcct tttcttctcc tatctgcttt 152340 

accacacacc atagctgctt ttaccgtttt ctccttaaaa actcaaaaaa accttccccc 15240 0 

aacctatcta tccacttctt gtcctcccct caaacccact ccattttgat tctgcatcat 152460 

aaaaaccaag cagagttctg gggaggcaga agcctcctct tatgatattg gaagggggtg 15252 0 

gatgaaattg agcacacaaa gccagaatcc tcttttgtgg aaagatgggg gcagtgggca 152 580 

gagggaagca ggctcatctc tttccctctc ttcccttccc ttctctttca caccacacgc 152640 

tccgcctggg tgagctcatc gtcctcgtgt cttcaatttc cacctccagg agatcgactg 152700 

ccaaatttct acccgcaatc ccaaactcca ctctgagctc cagacccatc tactaattgc 152760 

ctagtcaaca ttttctctgt gggatcgaat cagataggat tatattctgc tgtgactaac 152820 

agagactgaa aattcagagt cctaaacaaa agactttctt tttcttgtgc tgaaaagtct 152 880 

ggaagtacac agccctaggc tggaataggg atactgctgg ggggctccgc ctccttgcag 152940 

gctgcccctc tgctatctcc agggtgtgtc cctcaaccac actgtccaaa gtggtgactg 153 000 

aaggaccagt cctcccactg acattccagg caacagaatg gagaagaaca ggctgaggaa 153 060 

ggaggaacca caggcgccct ccagctgttt caaggaaggt tcctggaaga tgctaaataa 153120 

caattccgtt tatatctcat tagccaggct tagtcacgtg gcaacaccta gaagcaagaa 153180 

aggctgggaa atatactgtt tctgctgggt taccgtatgc atgactaaaa agagagcgct 153240 

ctgttcttag gaaaatgaga atagactggg ttgggggagg ataaccagca ttggctacac 1533 00 

tgcagcccca caacctcctc atgcttgatg tggctgaaat aaaaagagca tctcccccta 153 360 

acatttcctt tcccttagtc ttgttccttt tctgtacccc ataccctagc agaggggcaa 15342 0 

ttctgtgggg tgggctcact ggcaagggga ggtctcaccc aagctcagtc aagggtgagc 153480 
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agagagaatt 
caaacagaag 
atgtcagcag 
ccagcccttc 
caccctttgc 
atgtggtata 
ctccacctgt 
gtcagtacag 
ttgggacagg 
gaaggaggag 
cttgaaccct 
cccatttcca 
gctgggtggg 
agggacacta 

aggtgggccc 
cccagcctcc 
gtgacctgtt 
ccccaccatg 
cagcacacct 
cgtccctcct 
gtgggtgcat 
gcgaatgtgt 
cttgggcagg 
tgctaacagg 
aaactgcctg 
ctcctgcctt 
gactccccaa 
gcctgaggta 
gttcagaagc 
ggctggaaaa 
gtcatgggga 
tgaaagtcac 
tctcatgaga 
gctgctcctg 
agatgcagga 
agggagacgg 
ccacctggac 
tgctatcctt 
atcagactga 
ggtttggagg 
ctcagagcct 
cagggaaacc 
accctgcaga 
cagcctcaaa 
acaaagacac 
gcacacacag 
gcaaagtctt 
acagacggac 
ccatgttggc 
aaggaggccc 
ctgtagctgg 
aaaatagccc 
ctcatttgat 

ggtggactga 

caggcagaac 
attcatccgt 
cggtggtgtg 
cagcctcctg 



gtacaaggaa 
atgtccaggt 
agtaggacac 
aaccggctgg 
acttttgtct 
ctctcagcca 
ctgctgggaa 
aggtgtggga 
gtattgggga 
gaaccagctg 
gggccaccag 
gggaggaaga 
gcaggcagag 
gagacagggg 
gggtgaccag 
tgagctgtgg 
ctctctaagc 
gttgtggtga 
gaggttcagt 
gaggcaggtg 
gcatgtgtgt 
gtgtgtgcgc 
agtccggggg 
ctgaggtggg 
ctggtctgcc 
ccaaacccag 
gcagacacag 
gggatggatg 
acagaaggca 
agagtttggg 
gccacggaga 
ctctggttgc 
ggggaaatcc 
gggaggacct 
agctggtgag 
gagggcacag 
aattgaatcg 
ctctcatgtc 
ggggggccga 
gcgctggcac 
gctgccgtgg 
aggtggaagg 
ccccaggaca 

cgaggtgcag 

aagatgcagc 
acacagtcag 
acacatacaa 
acacatagac 
ttggtatcca 
tctggggcag 
ggagcatttt 
ttgcaaaggt 
ggggctgtgg 
gaagcatctg 
gagtcagctg 
tcattcattt 
atcacactta 
agtacctagg 



gctgaatttc 
ggagcaaatg 
aagggcaggt 
ccaccaagca 
gtaccatctc 
cccctcaggc 
ccctcacatc 
ggtaggtcca 
ggttcagcag 
ttgacagagc 
gctcctgcag 
tgatgcagcc 
gaattcctgg 
tggcccagga 
ctgggagcca 
cacaggcaag 
ttcagctttc 
gggtaactga 
caacaaggcc 
gtggaggcac 
gcatgcatgt 
acatgcatgt 
caaggggggg 
ggatgacatg 
acatcgggcc 
gaagtcttga 
cacccacaca 
tggacaagca 
ggacgtgccc 
aggtcagcca 
gtataggagt 
agatagagat 
ctgaccacag 
actcagaggc 
gtcagtgggt 
agaggtagag 
agcttcctct 
agactggatg 
aatgtggggg 
tgaggacagg 
ccctcagaga 
gaaaaagaaa 
tgcacagcag 
gcctgcaggc 
aacacacaca 
acaatccaag 
agacacactt 
gcacactggg 
gaattgttgg 
tggctctgag 

gg ct ggg ca g 

ttggagtgga 
ggcccttctc 
gaacttctgt 
cctttgaaag 
atttatgaca 
ctgcagcctt 
actacagacg 



cagctggtct 
cagctgagag 
ctctactgaa 
catcggctcc 
ctccaccagg 
tcagatccag 
acactgggag 
atcctggctg 
gaaggtggga 
agcaacagca 
gcaactggga 
tgaggcaatt 
aaggaagctc 
gctgcccacc 
cccctgggtg 
agcaccggcc 
tcatctgtag 
agtgtcaggg 
cactggtgaa 
atgccctctc 
gtgtgtgcat 
gtgctaagct 
acatgcaaag 
tcttagaaac 
aagctgtggg 
aatccccatg 
agggtctaag 
gctcagggct 
aaggggccgg 
agggcagtcc 
ggtgatggga 
ggactggggg 
tccagaggaa 
tgattcttcc 
gggcccacaa 
ggcagggact 
agtgtggaat 
cggcccaagg 
tgggggaagg 
ggatacgccc 
cgagcccatc 
aaaggggcag 
acacacactc 
cccaaccaag 
cgtaatacag 
aaacacactc 
aagcacagac 
aggcggcctg 
tgcagcgcac 
aagaaaggca 
agaggaggaa 
agagcaaaag 
tgacgggcag 
gggggcacat 

ctccacgagg 
ggttctctct 
aaactcctgg 
tgagctacta 



ggaggaaaca 
ttcttgtcca 
agcaccaggg 
actctgcctc 
agtgcccttc 
agccacctcc 
acgaagccaa 
gacctggctt 
cctcaaggct 
gggagccacc 
gcagctcagc 
ggccacctgg 
acccctcagc 
tcctatcccc 
ctcaggtacc 
cacggggtct 
aaagagggtg 
taggggac cc 
gaaaacctcg 
cattgtgtgt 
gtgtgtgcat 
ctggagaaca 
ttctctccac 
aaacttcacc 
tacttggaaa 
gaggggacct 
ggaccccttt 
ggggctggcc 
acatggggtt 
acgccttggc 
tggccccaga 
cagggctgag 
aacagcgggc 
tcgtgctacc 
ccatgctgga 
ctgaggaccc 
gtgggctcac 
gtggggtctg 
agggagcttc 
cccatgagtc 
tttggagctg 
gcatgagtgt 
caatccctcc 
gacccctgtt 
aaacacagtc 
acacatgaac 
acaaacacac 
tggagactgc 
tccccatgcc 
gcaagcctag 
ggagagcatt 
ggatgtccca 
ccttcccagg 
ggggcagagg 
gacctagcag 
ctgtcaccca 
gctcaagcaa 
cacctggcta 



ctgaaaaccc 

aggggacaga 

gcaaagtcac 
ctgcccgtgc 
ccgctcctcc 
tccaagaagt 
aagtgggggg 
tgggtgccat 
ttctttttat 
aaggctgagc 
ctctcacagt 
aggaagttca 
actggggtag 
atgaataagc 
atgcttctgg 
gcatggggca 
aagagtcctt 
agtggggcac 
cccaccctcc 
gcatgagtgt 
gcatgtgtgt 
agcagccagg 
agcccctcag 
ctctgcccta 
gagggccctg 

ggggccaggg 
ctccccaacc 
taggtggaag 
ggaggctagg 
tcttctctgg 
gttgtgcctt 
aggaagggca 
ctggggagag 
ccacccaggg 
gaggagcagg 
tcatccatgc 
tggggctatt 
tgactccctc 
caaacaggaa 
ttcacccctg 
gagttgggga 
gaggggccag 
tggctctgct 
acacacagac 
acacactagc 
agacatataa 
caacagatgc 
tgattggata 
tgacctcatt 
ggaggtgaca 
tcagggaggg 
agaactgtgc 
cacaaaggga 
ggaaggggaa 
aaattcattc 
agctggagtg 
tcctccaccc 
atttttaaaa 
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ctttttagag 
caagccatcc 
cccagcccag 
tccccagaag 
gtgccccctg 
gcgcctccgt 
gcctgccctc 
tgtaggggca 
gggcagggaa 
gaaggggccg 
gcagctggac 
cccctctccc 
tcagggaggg 
aggccctgcc 
gcggcgcggc 
ggcggaaggc 
ccaccgagct 
ggaggtgagt 
tgccgtcccg 
acagatctgc 
cgaccttggt 
tcagaggctt 
ggggcctgct 
gggagttgag 
ccgaagcgca 
ttccgggcca 
cctatctgtg 
agggaccctc 
ctgtgggcag 
acccaagggg 
gaggccattt 
cacatggtcc 
gccgtttctg 
ctgtggcctc 
gagcagagag 
tgggtgggct 
tcccatagtg 
ctccgaggat 
ccagcacttt 
tgagccgtga 
aaaaggagcg 
tgggatggga 
gactaatcgg 
actagaggtc 
ggctggggcc 
atcaaactgt 
ggcccctgaa 
gctttcggga 

ggtgtgtgtg 
ccatgggatc 
cactctgtca 
tccaggctca 
ctaccacgct 
ctggtctcga 
ttacaggcat 
tccttaaaga 
gaaaattcag 
gcttttatca 



acagggtctc 
tcccacctca 
aaagagacct 
ggccccggtfc 
gtgcggagtg 
cgtctgaact 
atccgcaatt 
gtaggggtgg 
gtggaggggt 
aggctgagcc 
cgcaggaggc 
ttctcccacc 
gccctggacg 
gctggcctcg 
catccggcct 
ctcagggccg 

gggggcccgg 

tctggggcgg 
ggctgtcggc 
aagcccctct 
ctcccacctg 
aggctggaga 
ccagggagcc 
tgtatgtgtg 
acagagtact 
gcttgaggtc 
aggttagcgt 
cggagagcag 
agcatgggag 
gtgcggcagg 
ctgtcactcc 
catcttctta 
atctgtgctt 
acttagcccg 
cagcaggcgg 
ctttgtctac 
ctcctgggga 
tctgtcaggc 
gggaggatga 
tcgtgccact 
ggagtggggg 
gtctctggtt 
ccctgcctat 
aatatgaagc 
agagtttggc 
cccactcaat 
gcggggctca 
gctgggcttg 
ctcgcttggc 
ctccttcttt 
cccaggctgg 
agcaattctc 

gggctaatat 

actcctgagc 
gagccaccac 
actcatgatg 
aatctgcatc 
ataccttgca 



cctttgttgc 
gactcccaaa 
tcaccccaca 
aggacaagaa 
gtccccttgc 
tgagcaaaga 
ctgcctcttt 
gaggagagag 
ctagggcggg 
ccaaggtggg 
cgaccagggt 
ccctgggctg 
cgggaccagg 
cacatctggg 
gggggagggg 
ggcgcacgtt 
gatcgcggcg 
gggctgccgg 
tgagacccca 
ctgcagcgtg 
ggcttcgggg 
aactcctgtg 
aaactgaggg 
tgtgttgggg 
gagtggccca 
caggtggggt 
gtgttgatga 
acgcagcgca 
gggctctgtt 
atgtatggaa 
ctgccccagc 
agggctcctc 
tgggccctgg 
ctgacggagt 
aagaggcagg 
acaggcatcg 
catgagcgcc 
aaagggttgg 
agtgggagga 
gcactccagc 
cagggttcgt 
gtgagttcct 
tgggtagaag 
caagagttag 
atgtgcttgt 
tactctcaaa 
gcctcccctc 
tcccgtgttc 
agtcagccac 
aacacaattt 
agtgcagtgg 
ccacctcagc 
ttgtattttt 
tcaagcaatc 
acccagccct 
actagaatgc 
atgaagggaa 
tatctggact 



ccaggctggt 
gcgctggatt 
aaccccctgt 
aaaggctgtg 
cgggcatggg 
cagagctagg 
tctggccact 
ggttccgggt 
cccttccccc 
cctgtggggg 
ctggaactcc 
ctaggggagc 
ctgggcccct 
ccgggagcgc 
gcggcggccg 
gagggctgcg 
gctggacggg 
gcgccccgag 
ccaaaaacct 
gctgcgggct 
gaccacctcc 
tcttaggggc 
acccggtgcc 
ggtgcctcag 
ggtgccaggg 
ggggtccagg 
gagagactgg 
ggaatcccag 
gtttgtgaac 
agcaaagatg 
gcaggatggg 
tccagaacag 
ttggcccagt 
cctgttccat 
gtcctagtga 
tgtggagccc 
cagctgaaac 
gcccctgcaa 
tcgcttgagg 
ctaggcaaca 
cctaggctat 
gctgggccac 
tcagggccag 
gggtcagtcc 
agccaggaaa 
cccataggca 
attttcagct 
tgtggtcagc 
ctcttggctt 
tttttttttt 
tgtgatcatg 
gtccaaagtg 
cgtagagatg 
cacccacttt 
taacacaaat 
gattctcagt 
agaacactgc 
tgttaacata 



ctttgttgcc 
acaggcatga 
ctggaggcca 
tgcctcggag 
gcctcagtct 
cctagcctgg 
gaggaccggg 
caagggactg 
tccgctgttg 
cggggcgggg 
tcttcccctc 
gggggaggcg 
cggcggaggc 
gggccgaccc 
gggtgggccg 
accgccgcag 
gctggagctg 
tggagaaagg 
ccagactttg 
cgggaaggca 
cctgtccctc 
tggcaggact 
taggtctgag 
gggtcccagg 
tctgaggctg 
ttcccgctag 
aagggagtga 
gccagagggg 
tggctgctcc 
aggacaggga 
gcagccacct 
agtggggcag 
ctgtggcttg 
tacagggctg 

ggaggggagc 

tgaaacccag 
accccacaca 

ggtggctcac 

ccaggtggtc 
gagcaagacc 
tgtaagacca 
accctatcct 
gggtcagggt 
atgggcaggg 
tggagttcct 
ccaccagtct 
gtggactctg 
actctcctta 
acctccacct 
ttfcttttgga 
acttactgca 
gctgggactc 
gggttttgct 
gacctcccaa 
tgatttagtt 
aagaccaaaa 
tcaaacctgc 
agccctctgt 



cagccagctt 
gccacgggtt 
agtcccagca 
gagaggctgg 
gtgagactcc 
ggacccgagg 
ctagggaagc 
gaggggtctg 
ggccctcgct 
gagcccctca 
cctccgccgc 
caggaggggc 
ccgcgcaggc 
ggcggcgcag 
gccccaagaa 
cgggcacggc 
tcgggagggc 
cgaggaggtt 
tctggtgggg 
cttggggcag 
gcacagctgc 
ggtgtgggct 
gtggaaacct 
ccctgtggtc 
ggacctggtc 
ggcagtggag 
tcacgaaact 
taatgggggg 
catgatgagg 

ggggccatgg 

cactcagcca 
ccctttgggt 
gaagaaacgc 
gctgggtgct 
tggaatgacc 
aacactgcat 
ggaagagatc 
gcctgtaatc 
gaggctgcag 
ctgtctcaaa 
ggggatactc 
ggtccccagt 
gaagctgatg 
ttagggatca 
ttctgtgtaa 
gaaattagta 
cccagaccta 
agtgtccctg 
ctcggggaat 
gacagggtct 
gcctcaacct 
taggcatgca 
gttgcccagg 
agtgctagaa 
cctacctcct 
tatctaacaa 
ccaactccct 
gtcacccatt 



157020 
157080 
157140 
157200 
157260 
157320 
157380 
157440 
157500 
157560 
157620 
157680 
157740 
157800 
157860 
157920 
157980 
158040 
158100 
158160 
158220 
158280 
158340 
158400 
158460 
158520 
158580 
158640 
158700 
158760 
158820 
158880 
158940 
159000 
159060 
159120 
159180 
159240 
159300 
159360 
159420 
159480 
159540 
159600 
159660 
159720 
159780 
159840 
159900 
159960 
160020 
160080 
160140 
160200 
160260 
160320 
160380 
160440 
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ctacaggaaa 
aataggctct 
gaga egg tag 
gatgtggaat 
gtctcactct 
tgcttcccag 
gtctgccact 
ggccaggctg 
gctgggatta 
ctaggttgtt 
acaaagtggt 
gctgaagaat 
cagtagtatg 
atggtgtttg 
aatgtgatgt 
ggtgggaccc 
ctggctccta 
agetgetata 
atccaactcc 
atgeaggaga 
ggctggtttg 
gccagctctg 
gagaaggaat 
accaatgaga 
cttctcaaac 
cggtggctta 
gecaggagtt 
aaaattagee 
ccccagctac 
agtgagctga 
tgcctcaaaa 
catgttctga 
caggtgatac 
tttgaaatat 
catcttagaa 
gccctcacat 
gcgtcactac 
agctcatgtc 
agtgtttcca 
acaatcttca 
cagectttaa 
caacgaaggc 
gggeggatea 
ctctactaaa 
tgaggaggct 
gatcgtg'cca 
aagacaaatt 
gttgcccttc 
gcccagccca 
tttgaccttc 
ctccatcaga 
gcacctggca 
aagccatttt 
tgttaatttc 
tgttggagag 
caaaggactc 
agggacaaat 
gatgttggct 



gggcactgag 
tgccatccct 
ataactggac 
ccagtgtaga 
tctcgcccag 
gttcaaacga 
aegcttgget 
gtctcgaatt 
caggagtgag 
tctcctccag 
tccccagtgc 
cagaacaaca 
ctggttaatg 
ccaatttctg 
cattgaatgc 
agctcagcat 
agaaccctcc 
tgcatgccca 
aatggtacag 
atcctggaca 
tgtgaactgt 
tgtctgcctg 
gcattggccc 
acatagtgtc 
ttcaatgtgc 
cgcctgtaat 
cgagaccagc 
aggcattttt 
ttgggaggct 
gaactgagat 
ataaataaat 
ttcagcaggt 
caatgetget 
cctgcctata 
aataaatatt 
gtattcctga 
actccagcct 
acctcttgct 
gtctcatctg 
tgagaccatg 
atgtaggttc 
caggcaeggt 
cctgaggtca 
aatacaaaaa 
gaggtaggag 
ctgcattcca 
gagtcatgee 
atataaaatc 
gccctctgtc 
ctagcactca 
ccgttggttt 
cagagcaggc 
ttgacacaag 
attagacatg 
atgtattaaa 
cagecaaaag 
aaatcccact 
gtcceggaga 



gcacagagat 
gaaaccccag 
aagcttccga 
gegaggaetc 
gctggagtgc 
ttctcctgcc 
aatttttgta 
cctgacctca 
ccacggcgcc 
gtacaatagc 
ttagcacaga 
tacaaataag 
gtttccaatc 
tcttgtaaat 
agaattggga 
cccactccat 
cctccccctc 
caaccagcac 
aggacagcaa 
tacaagagtg 
gtgagcaagg 
tgttctatag 
tgegtagect 
agtggtctgc 
acctgaatca 
cccagcactt 
ctggccaata 
aggtggcatt 
gaggcaggat 
cacgccactg 
aaataaataa 
ctgggctggg 
gatcctcagc 
gcctggcccc 
cttcctcacc 
tttcagccaa 
gggcaacaga 
cactcctcac 
gggacctttt 
aggcatctca 
tcctcgtgac 
ggctcatgcc 
ggagttcgag 
ttagctgggc 
aatggcttga 
gcctgggcaa 
cccaccccac 
aaatctgaaa 
cagacttacc 
geaaegcttg 
ctagggcagg 
accagaaaac 
ctgggaactg 
agaatggtgt 
gagttcaegg 
taaacaaata 
ctgcttgtct 
tgggectgea 



gtggcatgat 
cataggggca 
tagctgggaa 
agtctttttt 
aatggcgcga 
tcagcctcct 
tttttagtag 
ggtgatccac 
cggcccagag 
ctcggtccag 
gcctggcatg 
gctgccagct 
agctctggaa 
actcccagcc 
agagatagaa 
catcctgagc 
tctgtgcttt 
ccatccttct 
gagtgtctgg 
tgtgggagtc 
gtgtgtgcat 
caaaactttc 
cctcctttgt 
agactggctg 
cctgggaatc 
tgggaggctg 

tggtgaaacc 
ttaggcaaat 
aatcacttga 
cactccagcc 
ataaataaat 
gectgaggtt 
ccacactttg 
agaacttcat 
tcccttcagc 
gcttttcaga 
gtgaaactcc 
ctaagcttac 
acagggatct 
gtctcctggg 
agagtgacac 
tgtaatccca 
aacagectgg 
atggaggcac 
acctgggagg 
cagagtgaga 
cctgccacct 
tagggtgeae 
atgtcctctt 
tcatttcaaa 
gacagtgtct 
atttagaagg 
tgggacttta 
tgegatgaca 
gtgaaacaac 
aaataaataa 
gttcctgttg 
ggggctgtac 



gcactcacgc 
ctcagcatat 
gagtataggc 
tttttttttc 
tcttggctca 
gagtagctgg 
agacaaggtt 
ctgcctcagc 
tctttttttt 
aaaatccatt 
aagaatctcg 
cctccctcga 
gaaggataaa 
atgactgatt 
agaatcagct 
tgggctctgg 
cccttttctc 
tctgattctg 
aaatctccat 
caggtgtctg 
gtgtctatgg 
agattgggee 
ttagaggagc 
agagacactg 
ttgttaaaat 
agaegggegg 
ccgactctac 
taggtggcgc 
acccaggaga 
tgggggacag 
aaataaataa 
ctgeatttet 
agtaacaaag 
agacaggatc 
tttagcgccc 
aattctagct 
atcttaagaa 
ctcccttggg 
agctttaccc 
ceggtatata 
cctatctcaa 
gcactttgag 
ccaacatggc 
tcacctgtaa 
cagaggttgc 
ccctgtctca 
caaacctttc 
aggccctgcc 
cccaccccgc 
atcacttgcc 
ctctgggtca 
atcctgattc 
gatgagattt 
ttgtttaaaa 
atgatggctg 
gcattcagat 
acagctacag 
ateggtaaga 



taggtgecat 
cgcacaatga 
ccatggtttg 
ttgagatggg 
ctgcaacctc 
aattacaggc 
tcaccatgtt 
ctctgaaagt 
tctggtggtt 
cattcattca 
ataagtgttt 
gaccttgatc 
atcctgattt 
tgaagctacc 
egcatgaget 
gctcctgccc 
ctcaacagcc 
attcctgagc 
ggtgctggcc 
tccacaatgg 
geatgecaga 
gctggtgaca 
agccaggccc 
t tgagcaaca 
aggecagacg 
atcacctgag 
taaaaataca 
atgcctgtgg 
tggagattgc 
agtgagactc 
atatgaaata 
aacaagcttc 
ctgttgatgg 
gagcccctcc 
tccaactact 
caatatgatt 
aagaagttct 
ccactaaaac 
atctcttggt 
ttcctctgtt 
aacaaacaaa 
aggecgaggt 
gaaaccccgt 
tcccagctac 
agtgagccaa 
gaaaaaagaa 
aagatttget 
tgcagggtaa 
attatccaac 
tgattatctc 
ctatctccca 
caagaaagta 
aaggaattac 
egtctttate 
ggatttgett 
gagggaacaa 
ggcctgcaag 
acccccacca 



160500 
160560 
160620 
160680 
160740 
160800 
160860 
160920 
160980 
161040 
161100 
161160 
161220 
161280 
161340 
161400 
161460 
161520 
161580 
161640 
161700 
161760 
161820 
161880 
161940 
162000 
162060 
162120 
162180 
162240 
162300 
162360 
162420 
162480 
162540 
162600 
162660 
162720 
162780 
162840 
162900 
162960 
163020 
163080 
163140 
163200 
163260 
163320 
163380 
163440 
163500 
163560 
163620 
163680 
163740 
163800 
163860 
163920 
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ttctgccctg acccatcacc cactcacccg agcaggcact gagacccacc tacaggttgt 16398 0 

gcaatgtacc tctcccacag agcttcccaa cagctcttct gtgagtgagt gtgtgtgtgt 164 04 0 

gtgtgcatgt gtgcagggca ggtctgtgtt tactggaccc ccctgtgtgt gcagaaaggg 164100 

tctgtgtgca ccggtctgtg tatgtgagta tgtgtgcaca caggactggt ctattcacag 16416 0 

gaccctcatg tgtatatgag tgtgggctga gtgcgtgtgt ttgtgtgtgc cagaagggtc 16422 0 

tgtgcacaca ggagtgtatg catgtgcgtg tgttttgttc aggctctgtc cacaggaccc 16428 0 

tgtgtgagtg tgtcagtgag tgtgtgtctc tgcgggcagg tctctgtgca cgggatctgt 16434 0 

gtacgggagt atagtctgca ggtgggcatg catacagaga gttctgcgtg tgcatggagg 164400 

gtctgtgtgc acaggagtgt ttgaatgtcc atgtgttttg ttcaggccat gtgcacagga 164460 

ccccgtgtga gtgcatcagt gagtgtgtgt ctctgcgggc aggtctctgt gcacaggatc 16452 0 

tgtgtacggg agtatagtct gcaggtgggc acgcatgcag agagttctgc gtgtgcatgt 16458 0 

gcgtctggcc atagcctggt ctgggagcag gcttgggttc tccgcagggg tgggaggccg 164 64 0 

gaggcaagag ggacactcct caggaagtcc attcacaaag acctggtggg ctgggccctt 16470 0 

ccttcctgcc tcccttccca agctcgagcc acagcaactt ggcctctcgg gctgcagctg 164760 

gtggcagggc tggagcctgc cctggctgcc tgaggagagg gagcccttga aatgtggcca 16482 0 

ctttttccct gcccctccag cacctcagac tctggccctg tccaggcgct tcccagagtc 164880 

cccctcatgc ctggcccttc gcctgtgttc ccaggttacc cccaaggcca gggagagtgg 164 94 0 

agacctgcct ggcagcacgt gacaggactg tggctgaggg agcacagggc ctgcctggct 165000 

cagcctaagg cgagccagac ctgcctgatt tgtccatgga gcttcctacc agatctcact 165060 

ctccctccct agccagcctc cctggcctgt ccatggatat caaggctgcc ccagtactcc 16512 0 

tcacatgccc cttagcaatg cccagtgtgg tggtggcagc taccagccag gcaggccctc 165180 

ctgccccata agccgcctga tcctcggggc gaggcctggg gctgtcctcc agggctactc 16524 0 

tactccctcc caccaaaggt ccagctcagt cccaggcgtg caagagggtc ttgatggggc 1653 0 0 

gtcctgccct gaaatgctgg ctcagggtgt gtgtatgtgt gtgtgtgggg ggtgggaatc 165360 

agacttccta atgttgtgga cacccccagt tcaggactgg gctatctcta tgggtgtggg 16542 0 

aggaggtgct ccccctcctc gatcctcctc aagcagccag ctgggcctgt ctgtggccct 165480 

tgtgggcatg ggagctgcag agttccctca agtcctgatt agaaccacgt ggcccaactt 165540 

tagggtggag gggagcaacc ccacctccta gccagttagc tctgtagctc caccttttga 165600 

aggaggaggg cttctctctc ctctactccc tcccaggagc tcacctgccc tatctcccct 165660 

gcccttccat ctctggactg agtctccctg ccacacaatg cttttctgtc tctctcccct 16572 0 

cccacactgt gtgtctgagg ggccctagtg gggagacagt gagctcctgg ggaaagcatg 165780 

ggtcatctgc tagggaaacg ttgcctctga cagcagaccc tcagctcctg cccctgcgtc 165840 

atgtgtgtct ttcctacacc ccgcagtcct cctccccagc agagagccct ttgccttagc 165 90 0 

ccagcaagga ctgatggaga acctttgggt gcctggcttg gggccccgct gccataccta 165960 

ccctgcttgt gccaggatga actgccgtct cttctctgca ggctccagcc cggagcggtc 16602 0 

cccagtgcct agcccacccg gctccccgag gacccaggaa agctgcggca ttgcccccct 166080 

cacaccctcg cagtctccag taagcccaga gcagggacca ggtggtgggt gacctggctg 16614 0 

gtgtggacag ggtcgtgcgt ggcaaagtca tgacagggcc tcagctagga aggaggaggg 1662 0 0 

gatgggggta gcaccatgcc tcttgtcccc gtacactcca gtcatgtgcc ccccagaagg 1662 60 

attggccttg ggtgaccctg gactataaag ggtaacaatt ctactgagaa ggcgagcccc 16632 0 

atgtaactaa ctcccaaccc ccccacacag gaggcaccca cggccattga agctggcctg 166380 

gctgaggcaa gtgttcccct gagcatgtgg cttttctagg tccccagagc actgggcctc 16644 0 

ttgtggacca cagcccacct taatccaagt catatctgaa ggagaggaga tgtgttgcca 16650 0 

aaaaaaaaaa aaaatagtct ggaaaccatt agcttgggct gggcatggtg gctcatacct 166560 

gtaatcccag cactttggga ggccgaggtg ggcagattgc tttgagctca ggagttcaag 16662 0 

aacagcctga gcaacacagc gaaaccccat ctctacaaaa aatacaacaa ttagctgggc 166680 

gtggtggtgc acacctgtaa tcccagctac tcgggaggct gagggtggag aattgcttga 16674 0 

acctgggaag cggaggttgc agttagccaa gatcgcacca ctgcatttca gcctaggtga 1668 0 0 

cagagtgaga ccctgtctca aataaaaagg aaactactag cttgaagtat ccaggacacc 166860 

acctccccac ccccaagctc cctgggtcac cccacctgga tgaccttgcc ctggtgatcc 16692 0 

agcttaggga cccccttatt cttaggggag tccagccctg gcataggaga aggcacacac 166980 

ttctctgggg acatgcacgt gccctctcct ctctgccaca gaatcagaca gtctggtttg 167 04 0 

tgtcactatg gccacaaaga gcaagaggat aaaatatata cactggtcca tgtgctgtaa 167100 

aactgcctgg aggagcaaaa ttgctcccag ctgggcccag actaatggct gcctatgaag 167160 

atggtgtgag gtgggggtgg ggctaaggca gtccagaggg agggtgggca gaggctggaa 167220 

gcaggaaagc aggagcttga gccaaaccct gcctggggct acctgagaga cacgtccaag 167280 

gctcagcctg gagcctggga gggcaaggga gcccgagcag gtgggcaggt ggggtgggcc 16734 0 

aggtccaggg ctggtctgga ccacagtcag tggaggggag tgtcttcccc atgcagagga 1674 0 0 
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agcattgggg 
gcacttgcag 
cattgacttc 
catccacatg 
ttcctcatac 
tgccaccccc 
gggtccttgg 
tttaattttg 
tattttcctt 
tggcacgatc 
agcctcccaa 
gttttttgtt 
gcaatctcag 
tcttgagtag 
tgtattttta 
ttcgggtgat 
cacctggcct 
tgcagtggtg 
acttcagcct 
aaaaaaaaaa 
aattcctggc 
gtgagccacc 
tacaaacata 
gcttcaacac 
aacggcctag 
tttagggggg 
ccttctttga 
gtccaacatc 
taaaacaaga 
tcacacagtt 
ggacctcaag 
gcaggcagta 
gtaagagggg 
acaggtaact 
ttccacatca 
gcatcccaga 
catccaggaa 
attccagcct 
aatacccatc 
ctcacctgga 
gcattctgtt 
ctcttggcta 
atctcacctt 
tctccctgct 
cacccaagaa 
agacccgggc 
cttccacagc 
gcgggactgg 
cagggctcca 
tgggacaaaa 
ggctaaaggg 
gctaatgtgg 
ggggtcgggg 
gaagctccag 
gaggcctcct 
gaggcagtaa 
ttcagggagc 
cccctatttt 



ctgtggggga 
aaacccgagg 
ggcaccacgt 
atgaggtgag 
cttggtccca 
agtctggggc 
cctcaactgg 
aatttttttg 
tttttttttt 
ttagctcatt 
gtagctggga 
tttttttgag 
ctcactgcaa 
ctgggactac 
gtagagacgg 
ccgccagcct 
tttttttttt 
tgatcatagc 
cctgagtagc 
attttttttt 
ctcaagtgat 
atgcccagcc 
tatgtgtttt 
tattttgtag 
tattccatag 
ttttttcctc 
aaacttgtgc 
.tcaaatcttt 
aataaaaata 
gttttgaaag 
gaccttggga 
ggtgaagcta 
cttcccaagg 
gctcccaaaa 
caggaaacca 
tgtcttgctg 
cactgtctct 
cctcccaagc 
tacttctctc 
tggctgcaat 
gctccatggg 
ctttaaatcc 
agcattcttg 
ctttcacatc 
gggagggttg 
gtggcccacc 
tttggctaca 
ctctacttcg 
gacagggagg 
ccaaaacgtg 
agacgtcgga 
gtggggctaa 
cagagctaat 
gaaacggggt 
gcagagccgc 
atggaaagac 
acgcccttca 
cccctcatcc 



tgggggtgtc 
tccgagcccc 
ctagtggcta 
gtcggctggg 
aaagtactgt 
tccccactgg 
agcagctccc 
agatataaca 
tgaggcggag 
gcaacctcca 
atacaggtgt 
acggagtctc 
gctctgcctc 
aggcgaccgc 
ggtttcaccg 
cggcctccca 
ttctttgaga 
tcactgcagc 
taggactaca 
ggagagacag 
tatcccactt 
cacaaatatt 
tttaagtaaa 
atatttcatg 
tacagatgtg 
ctctcatttt 
cgcaatctcc 
tttcttttcc 
aaaacaagaa 
acatgtttca 
cctcaagacc 
caggagacct 
cctggagact 
tacatgggac 
caaaggctct 
atcaacacac 
gtcctgtgga 
tctctcttcc 
catggccact 
atcctcctcc 
agccagtgac 
acgaagacag 
tgttttctcc 
tgtaagtttt 
cactgactgc 
agaagacccc 
ccgcccgcga 
agaagttcaa 
cggggccagc 
tgaggaccgg 
ctccggtgtg 
tagtgaagct 
gtcacatggg 
gattttaaga 
cctgtgtccc 
gatgcccgcc 
ggtgcgctgc 
gaaaccgctc 



cctgctggct 
ccagcaggcc 
tgctttcagc 
ctgagagagt 
caccgagaca 
ggtaaaagtt 
aaccctctgt 
aacataacaa 
tctcactctg 
cctctcgggt 
gcgccactat 
actctgttgt 
ctgggttcat 
caccacaccc 
tgttagccag 
gagtgctagg 
cagggtctcg 
ctcaaactcc 
gacagcacca 
ggtctcactg 
tagcatccca 
tttctaagct 
agtgagatat 
tcagcacata 
tcataattta 
tcactattac 
atggggtaga 
ttttttagta 
aatgaaggtt 
aaaatccatt 
ccagtcctct 
caaagctctc 
ggatggaagg 
ttgagcttca 
gagcacttac 
atggctgagg 
ggtgacagtc 
caccagacac 
gatcgtgccc 
ttggtttcct 
ctttctttct 
aattctgaat 
tgctgcctat 
tgcacatgct 
cctgtgcctc 
gacctgcctg 
ttactaccat 
gatgaagatc 
atggaaaagg 
cccgatggag 
ggcggagctc 

ggggttgcag 

gcaagagtgg 
gcgaggtcgt 
tgccaggatc 
ctggaggtgt 
ggccccacct 
ccccatcccg 



gcttgctaat 
tccttctctg 
tttgccagtg 
gaggtgggga 
tggggtctcc 
ggaggatggg 
aggaaaagtc 
taaaatatgc 
tcgcctaggc 
tcaagcgatt 
gcctggctaa 
ccaggctgga 
gccattctcc 
agctaatttt 
gctggtctca 
attacagatg 
ctctgtcaca 
tgggctcaag 
ccacacctgg 
tgttgccagg 
aaaatgctgg 
tgtagacaaa 
acaacttgct 
cgaagctacc 
tccattcccc 
aaaaagtgca 
ttcctagagg 
aaaaggaaaa 
ctaagggctg 
caaatgttgg 
gaatatgtcc 
aacccagact 
gtgaatggag 
tgagactgga 
agccctctag 
ccatgctggg 
acaggaaata 
caagccccta 
aagttcagcc 
ttctccatgc 
aaaggcaaat 
ggccttctgt 
ggagccgacc 
gttccctctg 
cgcaggaaat 
ctgctgactc 
gacctggacc 
cacagcgcca 
gcagggctaa 
tcgtggctga 
agaaatgagg 
gaggggtggg 
gacggtggta 
caggaaatga 
tcaccttgaa 
tcgcccatgc 
ctgccgactg 
tccccgacat 



tctaagtctt 
tggtggtggc 
accctgaggc 
gggtggggag 
ttggaggccc 
ccccaggcct 
agactttgtt 
aaatgttttt 
tggagtgcaa 
ctcctgcctc 
cttttgtggg 
gtgcagtgac 
tgcctcagcc 
tttttttttt 
aattcctgac 
tgagc caeca 
caagctggag 
caatcctccc 
ctaatttaaa 
gctggtcttg 
aattgeagge 
cacacacata 
ttgttttaaa 
actttctttg 
tattgacagt 
gcagtaatat 
tgattcctag 
gaaaaaataa 
agttgacaag 
gttttatgtg 
tagagggtcg 
gggatgggag 
acatagacag 
ctgtccttcc 
gagctgeagg 
gctaaggaca 
actgggggac 
acatctctag 
ccgaccttct 
tcccttccct 
gtggtcatac 
gctcttcatg 
catagctagt 
cctggattcc 
gggagggegg 
eggagggege 
ccgaagaggc 
eggtgagtea 
tgggggtggg 
gagggggegg 
tggaggcggg 
gctaaggaga 
aagaggaggg 
gtgecaaget 
gacccagcta 
cctgcgcttc 
tggcagggac 
tggatgggta 
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gccaccgccg 
agcctgcacc 
ccttcctgtg 
caagccagct 
cgctgccaga 
cagccaagca 
aacactgctc 
gaccacagag 
ctcccaccag 
cccaggccag 
gagtcttttt 
gcgcagtggc 
tgcctcagcc 
ttgtattttt 
ccttgtgatc 
gcccggccaa 
aggcgaggga 
tctagcctca 
cacagagttg 
tgggagggaa 
gaggtagcag 
acattcactt 
tggttcttga 
tacaggtctt 
ccgtggagac 
tggatacaga 
ggacccccaa 
tctacacttt 
caggatggaa 
gcctattttc 
gggtaaaagt 
ccctacaaaa 
gatctcaagt 
tccctgctgt 
taccgtctga 
cacctcctct 
gccagcgctc 
ttgggaaatt 
ttcctgctgg 
cagcgtggct 
atgggtgggg 
cagtgccatc 
caggctggac 
gccgcctcgg 
gccccaggtg 
gagctgcccc 
ctcagccatg 
aggcagaaac 
caacctcgtc 
ttcaccttgc 
tgagggttgg 
ggaagtcagc 
cagccccaag 
tcccaaatgc 
ccaaagacct 
gagacctcct 
tacagggtgg 
tcaggagcac 



gagctcagag 
aagccatact 
atttgattag 
ttcctctcac 
gaaggacact 
gttcatgcgg 
aggaagggcc 
tcattgtgga 
aagagggagt 
tgtagacagt 
tttttttttt 
gtgatctcgg 
tcccgagtag 
agtagagacg 
cgcccgcctc 
gatggagagt 
gggaggcggg 
ggtttagcct 
gggccggctt 
acctctccaa 
ctgtgagcca 
cacttggcat 
gcaggggaaa 
cctaccactg 
agagagggca 
ccctaagctg 
tcctcccata 
tcactggaag 
gcaaggctgt 
ccaccctcct 
ctccattcac 
ccataatata 
gtttttctcc 
atcagattcc 
caccagccta 
gagccatcac 
agctggccgc 
tgaaggggca 
gcaacttctt 
atgaggcagc 
tgtgatggag 
ttaggtgtga 
tagtgtcccg 
tatactgccg 
gtgggcgcct 
cggggacacc 
tctagtatga 
atggctggtg 
tcagggtggg 
cctaggggag 

gggagacaca 

ccactgcatc 
ctcctgctgc 
catctcccag 
acctgcctcc 
gcgttaaagg 
gagtttgggg 
tggctgctcc 



gtcatcttct 
gatgggaggg 
tacctccagt 
tgccccctcc 
gtgcgctggg 
gaggctgcct 
aggcctgtcc 
aagaacttca 
gggcctgcag 
gccttaagtc 
tttttttttg 
ctcactgcaa 
ctgggactac 
gggtttcatc 
ggcctcccaa 
cttatatgtg 
acaggagaag 
c age t agate 
cttgcacatc 
gtgaggecat 
atagcageca 
cagtattttg 
taggactatg 
agatgetcaa 
tagectcaag 
tccaaaaggg 
cctatgggcc 
gagctaaccc 
ccccagccag 
tctcacgctg 
acccttatcc 
cttcccacag 
acccaactgg 
ctgggaacaa 
gtggagaaag 
acacccccac 
ttgtacctgt 
gatcccaggg 
ggcatatttg 
tccaaatttg 
gaggaggtgg 
cagcccacag 
agagaatgea 
caagctgcgc 

gggtgagcgc 

acccacccct 

aggggagagg 

gagectgtet 
acatgcccta 
caaagtctga 
gactacccca 
agctgtcccc 
caggaccagg 
ggcagggaga 
gctcctccaa 
aaggccctgt 
ttcaggattg 
cagtgcccat 



ccagtaccct 
gggecgatte 
teegcagagg 
tgtaccagga 
tgttgacggt 
acctggtgag 
ccatgcttgc 
gcctgctcct 
gaacagggga 

a ggtgtccta 
agacagtctt 
gctctgcctc 
aggtgcccgc 
gtgttagcca 
agtgctggga 
aggtctattg 
gagctatgaa 
ccatggggag 
cgtatcggtc 
tcccattcag 
acactcacag 
agggacaccc 
aaataaaggg 
aggtctagag 
actcattget 
atctcaagcc 
aggcatgacc 
tgagcacagt 
aacaaacacc 
ttccctggct 
ctctccccta 
ggatccccta 
cctgtcttgg 
agttctcctg 
agggttcaat 
attgtgctag 
gaaggggaac 
gttctaggtc 
gccagggcca 
tgcagcacag 
gagccccaca 
tgctgcctga 
gagcagctac 
ctgcaccagc 
cgctccatcg 
ggagggtcag 
gtacccaccc 
gaggaaggga 
cctagggcag 
ggagggcagg 
ggaggaaact 
ttcactcccc 
cacccaggcc 
agecctaagt 
acccctctaa 
gtccataaat 
ccctctccca 
ggaggtcctg 



cctccctttt 
ttccagctct 
gctgaagacc 
gctgagggag 
gcctgccatc 
gaegtgeagg 
atgcacccca 
gatgggagtt 
cagagggaca 
aaagcagagc 
gctctgtcgc 
ccaggttcac 
caccacgccc 
ggatggtttc 
ttacaggcgt 
aagaaggttt 
ggatgtggtc 
ctgcagaggg 
tgtcactggt 
ctgagggcag 
tggcaggcag 
ccaccaccgc 
aggegattet 
caaggagttg 
tggtegtaac 
tgcctgtgga 
cagggecagg 
ggacatccct 
tgctccccca 
gctcaggaaa 
tctggtcaga 
ccctgaagga 
tgcctggaat 
gaagtggggt 
gagctaaaag 
agtctctgcc 
ettgetgeag 
tgcattctgt 
ttccctcccc 

aggggectga 

ggccgcggtt 
ccctgcccac 
tcatcgccct 
tcctggacct 
actccagctt 
agggtcactg 

tggaggagee 

ggcacagccc 
cccaatctca 
actccgccca 
gctggggtac 
tcatcttccc 
aaggacagct 
cctgagtccc 
ccctgatttt 
atttccccac 
gtcaggagca 
ggcaggagga 



tgtctggtag 
gctgggaagt 
accctccctc 
cagagcccat 
tggaaacagc 

cgggcccgag 

ccacccttga 
tgtagagttg 
aaagacacag 
ccaagatgga 
ccaggctgga 
gccattctcc 
ggctaatttt 
aatctcctga 
gagccaccgc 
ctcaggagaa 
tttgctggag 
agaactgtag 
tctggggaaa 
ttatccagag 
ggcacccaga 
ccacctttgt 
agecaatgea 

gggggggctc 
cctagagcct 
caccctggag 
tctgtaaatg 
gtgeaaagat 
gcgtcccctt 
gtctggccct 
atctggggaa 
aataactcca 
ttcagcttcc 
ggcctatggc 
ggctccccac 
aataccccca 
ccccgctatc 
ctagagtctt 
agectggtea 
gaggectaac 
ccccacctca 
cacccatccc 
ggageccgag 
gagtggccgg 
ccgtcagggt 
aagccagaag 
caatttgagc 
tgcccaaggg 
gggtggagat 
gccctgagtc 
tgacaaagga 
cgacacacat 
atggtcatcc 
ctctgagact 
gccatgacct 
agctgttgga 
ggttggagtt 
tgggagttga 
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acgccatagc 
cgggagcagc 
gagctgtggg 
ccctaccggg 
tcctccattc 
ctcacagcgg 
agttgcagag 
aggttcaagc 
cccggggcag 
ccgcccacga 
gcggcggagg 
aggagctcta 
gggctccggc 
cctgctccac 
tgctgtgccg 
cctgggtaga 
caggggcgct 
gccacaacgt 
cagggtttgc 
gggggtctgc 
gcggggagcg 
gaactggggc 
ttctctacac 
gcaaggggag 
tcctctgagt 
caccacgtgc 
cacggacccc 
ccaccaacag 
aagccatgaa 
acctgagctt 
ctatcagtgc 
gaacacctgc 
gccaaggcaa 
tgggttctca 
tgtcttgcac 
cctccacgtc 
aggactcccg 
gtacaggcta 
tcccccgccc 
ccctgctggc 
ccgagtcagc 
tcgtggtccc 
cgccgggcgt 
gctttgtgcc 
gcaccgacgt 
tgcggcgcag 
gctgcgcggc 
tcagcctcga 
gccgcgagat 
acgtcagcac 
ccggcgcggt 
agcgggttgg 
cccgggggag 
attgggcact 
gagaagaggt 
gaacgcaaca 
ggaggagccg 
tcatcgaccg 



tggagcacct 
tgcgaaggtc 
cagagatgca 
caccatatac 
gctgtaccca 
cgcccctaac 
cctctggaac 
acctgaagcc 
ggctggaggc 
gtgtgtgccc 
caccgtggac 
caaggcatct 
cccgccactg 
cccagggggc 
catcttcggc 
tctgaccatc 
caacatctcg 
ggagaccgct 
cgacccggga 
ctgattcatc 
gcgagtgagt 
aagacagaga 
ccccaccccg 
gcccctccca 
gagtagggtg 
agtcggtgcc 
aaactggggc 
cgtgaacttc 
cgagctcttt 
ggtcccccac 
ctagatacct 
gggatgaagt 
aaaacttctg 
aaacgttatc 
acaatgtcta 
gtggtgggag 
tgagctctca 
gggaggggag 
cttctcctct 
acggccggag 
ggtgctgcag 
gcacgacgtg 
ggtgcgggtc 
tgggcgccac 
cttcgagcgc 
ctactgcccg 
agaggatgcg 
gcttgagccc 
ccgcgccgcc 
caatcgctcc 
gccagcgccg 
ggcgggggaa 
ataaggtcat 
ggtccgctga 
ttgcaagaca 
gaagagtcct 
caggaggaag 
actcagcccc 



ccttctaatc 
ccgccacagc 
agcaggtagg 
tgatgggggg 
acctggccgt 
tcccactcct 
catttctccc 
cctcacgtcc 
tgggcgaggc 
gcgctcgccg 
ctgacggtgc 
ggtgagtagc 
ccccctggcg 
ccttatggcg 
gaggacttca 
gccttcgagg 
ctgcccttct 
ctgcgcagga 
atgaccgtgc 
ccacatatac 
gccccagccc 
atcatagcgg 
ggccctgcgg 
ggacctcgta 
gagataaggg 
caggcacttc 
aagcagctgg 
gtgaagtggt 
cagcccaccg 
ccgcccctac 
ccacacatcc 
gcagtggtga 
gtgcctcagt 
acagggataa 
ggtgtttaat 
agaagtgggg 
aagaaagtgc 
gcgccggtgg 
gcccccttca 
gtgcagggtg 
cacgcggtgc 
ggcctcacca 
cgccgctcgc 
ccgcccgaaa 
ttcgtggccg 
gcgcgtcccg 
cgcttcatca 
gccgactgcg 
atgcagtttg 
gtgcgcgcgt 
tctgcccggc 
acgatagttc 
gggagagtgg 
ctgccaggct 
gagcgcgcag 
ggtcfcgaact 
gaggttgtgc 
aaccgggagc 
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tcactccctg 
cgcacgttcc 
gggaaagggg 
aagggcatgt 
cccctcacag 
ccaggggatt 
ccacaccctg 
ctcccccgac 
tggagggggc 
ccgcaggaga 
accagctgga 

caggcggcgc 

gcccggcgag 
cggtgggcgt 
tcgccacctt 
ctcgcaagcg 
ccttcattga 
gcaggtgggt 
actggagggt 
actaagccag 
agcaggctcc 
ggaggcggca 
gcccatgctc 
cctggaacct 
actaacatcc 
tgcctggagg 
gccctgaccg 
cctcacaggg 
tcagcgggat 
atgaacaaac 
atacactgtg 
ttaagagcta 
ttccccattt 
aatgagctga 
acgtgtaaaa 
agcgtgagtg 
tcaaatggct 
ccgcctagtg 
cccgcgtccc 
tgaagctgct 
aggcggcgct 
tcctcaaagg 
cgctcaccta 
agctgctggt 
ccgagcagtc 
gccagcggcg 
ccgaccccgg 
gccaggacac 
gcgacaccga 
ccatcgactt 
cccgccctct 
tgcagtctgc 
gtggggacac 
gaagggaccc 
cccggcaagg 
tggccgagta 
agggtctgga 
caggcagaaa 



ctgtctcctg 
tggtggagtc 
gacggagtgt 
ttgcaaagcc 
tcacccgcac 
ctccgcggac 
cgcccatatg 
cctgcagaca 
gcagggctga 
ccgctacgtg 
gcagccccat 
cccggtaccc 
cgctgacgcc 
ggacctggcc 
caaaaggcaa 
cactgctggc 
cttctaccgc 
cctgagcccg 
cccgggcccc 
cagggcgtcg 
acccacggaa 
atgcctatct 
ctcggtttcc 
ggagcaggct 
cgcaggtcca 
cagaggtaga 
atggatattt 
gatgctccga 
catccagcac 
agatgcagaa 
atgagaccta 
gagggttgat 
ataaaatggg 
agtacctaga 
tgcatatcct 
ttggggaggc 
actttctagt 
gtggcctcag 
cccgtcctgt 
gttcctagtg 
gggcgcccgc 
cgcggtgctg 
tggcgtgggc 
tcgcgacggc 

ggtggccctg 
cgtactcatc 
cgtgcgcaaa 
cgccggcgcg 
aattaaggtc 
tctttccaac 
ttcggttcag 
gcctttccac 
acccagagac 
gccaaggact 
ggcatgtgac 

ggggtggggg 
cctgcagggc 
aaccctgtgc 



acccccagct' 
aggcgtagga 
tatccttggc 
cgtctcttcc 
ccccacccca 
gctcgggtgg 
tggtggtctg 
ggccttggga 
gggtgcgagg 
gtggccgact 
ggcaccctca 
agcgcgaccc 
ctcttcgccc 
ttcgagcagc 
cggccggcag 
ccacaccgtg 
aagcagcggg 
cgggctcagg 
aaggaacggt 

gggtggggcg 
tccgcagccc 
cctcccagcc 
ctgcaccaaa 
ggcaactaaa 
gtcctccaga 
gaataaggac 
gcccctttca 
atgtcttgtg 
ataggtgagc 
taattccccc 
gaatcatcta 
atgtagtctt 
gtgatagtat 
gtgagcacaa 
tatctctcgt 
gaagccctcg 
cgccaggtag 
tggctctctc 
cccgcagagg 
ggcggcttcg 
ggtctgcgtg 
ttcggccagg 
gtgctcaacc 
cgccgctggt 
ggcgaggagg 
aacctgtact 
tgtggcgcgc 
cctcccggcc 
accgccgtcg 
tgagggcgcg 
gggcctgcgg 
gccctccagc 
tggctttggg 
gaacgggtaa 
cccgaaggaa 
tgggatggca 
tgaagttcac 
cgtaggaaag 
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tgactggaag 
cacccaaata 
cattattgtg 
gggaaggccc 
gagcagctgg 
ctattgactc 
agtgtgaaat 
ccctgaggtg 
tggagaccct 
agctttcagc 
tcccaaacaa 
attcaaatta 
ctcttgtggg 
ccaagtcttt 
aaatccacct 
agtcgcttaa 
tcctactcat 
ggcagcagta 

gggagtccct 

tcctcagaca 
agagaggagg 
cagaggtgag 
ggaagccaca 
ccggggcagg 
gtattccagc 
ccggtctgca 
cccactaagg 
gaggttgaga 
gacagaaacc 
cacccatgac 
ctagcagtat 
tataacctgc 
cccctggcac 
aagaaagggg 
cacaggtctc 
ccaatgtggc 
ggctgttttc 
gcagggtagg 
attgaagggc 
cacgaaactg 
gtgagcctga 
ttatccacac 
gaaggctgga 
aggtaaccat 
gacagggagc 
gaggtgccac 
accccatgtt 
gggacgggca 
tccttgctca 
ttcaggaagc 
cccacaggcc 
agagagccca 
acagacccct 
cacgggcccc 
gactggccaa 
tcctgctatg 
tcccatgggt 
cctccattcg 



tggactccag 
tgactgtgaa 
gtggaggcaa 
caaggtcagc 
agggtggatg 
ggtatgcctt 
gtcaggcact 
acaataaaac 
aatagtggga 
tgtgcatagg 
ctccactgct 
attgtcttgg 
gctaggtgct 
ccctggccca 
ggcctccggt 
aaaccaagta 
ggtggacagc 
atggggccac 
gtggcttggg 
gcctcctttc 
aagtgggaga 
aggcgtgggc 
ccatcacgca 
cggggcctcg 
tcggcgccca 
aggcagggtg 
ggacccctcc 
acccctgagc 
ctcgcagtcg 
gcgggcctgc 
ctcctctttg 
ccggggacag 
tcacgggccc 
cagggatggc 
aaaaacccat 
tccaagagtg 
tcccaaaagg 
ctttgagcag 
aaggtcatga 
acaccctagc 
taggaacagc 
ggtgtgtgta 
ggcttcaaac 
cacaccatgg 
aaggcctgga 
atccccgact 
ctcactccca 
ccacgctgag 
gcctgtgcac 
caacctacag 
cacccagttg 
gaaagcccaa 
tgagggtccc 
aaagtcctca 
aggctatgta 
catggcaccc 
cattcacctg 
ctcataggat 



agggacaggt 
ttgtggaaag 
ttatgagggt 
ttatgaagga 
gggaggccag 
taggacattc 
tccctctaac 
atttatgctc 
ggaatgcaag 
gaactgacca 
taagacctct 
aagccagctt 
ataataccac 
tctgcaaagg 
atccactgga 
acacagacct 
agtggggact 
gacgccacag 
gtggggagtg 
gggctcaacc 
ggaggacgag 
aaggccactc 
gcatgtcggg 
ggcctcagtc 
cacacttgac 
caagtcagtg 
ccttccctca 
cgggcaagga 
agcttgtgca 
accgtcacgc 
aggacgccct 
gggcaattgg 
ctacccacca 
ccttctggag 
tcttcagtga 
aatgaagtag 
aaggcaggga 
atccgatggc 
agaggctggt 
ctctgccagg 
agcctgaaac 
gctccagcag 
cagttcctca 
gcaggtacag 
gtgtgggacc 
tgctgagctg 
cacccatcca 
gagcttgagg 
tgggggcaga 
caggagaggt 
tgccacctat 
ctccctccat 
agccctacag 
gtggctcatt 
gtgtggccag 
agcaaggcac 
tccccatcac 
gacaccaggc 



gtggtggcac 
ggcagtagat 
agcatttctt 
ccccacttgc 
agggagcaat 
tcttaccgct 
tggcatgcaa 
aaggggaagc 
ggttcccggt 
gaagggggtg 
ctggcctaca 
ctggcctaaa 
gcggtgcccc 
agtaggcatt 
agtttatttc 
gaggggtggg 
agggaggggc 
agccagctcc 
ggggacccac 
catttcttcc 
ggaggggcag 
cacccccaca 
gggacaaggc 
aaagccgtgc 
accatccagc 
ccatgctggc 
gggatcagct 
tcccttgttc 
tccctcctcc 
gcacacaggt 
ctttgtgcgc 
tcagcacctc 
ccatgggggg 
cccagagacc 
gctggacatg 
ggtccaattc 
gaggcggtga 
aagaggtaaa 
gacggggaca 
ctgggagtgg 
ccaaggtctg 
gtttttctag 
gcagctccca 
ggaagtatga 
tgccatgctg 
cccatccacc 
ggccctgctg 
tgcaggctgg 
gtgaaggtaa 
gagggaggga 
gatcccatgc 
cactccagct 
cttggccagc 
tcatagacag 
ggtcaagtcc 
cacccactgc 
tgccagtatc 
cacagatagc 



agtcctggtg 
ctctaatgtg 
tcgagacaaa 
accccacccc 
gaggggtggt 
catgggcctc 
cagccccacc 
cacagcctgc 
gctagagaga 
ctgctgtctc 
catgaggtcc 
atgccaccac 
tgcctcctga 
accccaaccc 
tttagggttc 
ggctggggac 
aggagaggtg 
gtcctctccc 
cccaggccct 
ggcaggagac 
ggtggcagca 
cccaccccag 
ggggtttaag 
cagtcgctgt 
agcatgggcg 
ccccggcccc 
ggaggtaggg 
agccttggtt 
aaccaggagc 
gccagtgcca 
cgagtactca 
ccccagcctc 
actgcatgct 
cattccccct 
cctccagcct 
aggcttccaa 
tgaggagtga 
ggcctgaggg 
tgaaactgaa 
ggcatggggc 
ggactggtgg 
agggttaggg 
tcttggttac 
gctcatggac 
ccacagtgca 
ctagttggaa 
gaggagggga 
ggacaggtgc 
tctcatacct 
ggcaccttca 
gccccagcac 
gatcgacccc 
aagatcctca 
gaaaatggaa 
tccctagact 
ctacaaggga 
ccagggtcag 
gtgggatgag 



tggtgctgac 
gaggtgggaa 
acacccgtct 
agccatggaa 
cccagctctg 
agtttcctaa 
tgcctgagag 
tgatatggcg 
gaagggagaa 
ccatcaagca 
ctctctcctc 
ctgtgcatac 
gtgagtctac 
cagagaacaa 
tatcccaacc 
tgcacctccc 
gctgaagcaa 
agaccctggt 
ccctctccct 
tgaggcacac 
caaatgaagg 
agaggggcga 
gctgaggggc 
gctctgagtc 
tgccgtggtg 
acccatgcgg 
acctgccaag 
ccctgaggag 
ctcacgctct 
ccttcgcagg 
cacttgacac 
cccgatcctg 
aagccccccc 
agcaggggtg 
gtgggccacc 
aagaaggtct 
ggggggcagg 
gtgtcaatcc 
gctggagctc 
agggcctgaa 
gtgctaggag 
aggcaggagg 
tgccccacgg 
ttctgttcag 
agctcacaaa 
gtaagggagc 
cgcaccttca 
ctcgcggaca 
gtgcaggatc 
cttcctgctt 
ctgcttcccc 
cagccctttc 
gcatctgtct 
accacaaaat 
tcttggccat 
aagttcaaag 
ggatgctggt 
ggatgggatc 



177900 
177960 
178020 
178080 
178140 
178200 
178260 
178320 
178380 
178440 
178500 
178560 
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178680 
178740 
178800 
178860 
178920 
178980 
179040 
179100 
179160 
179220 
179280 
179340 
179400 
179460 
179520 
179580 
179640 
179700 
179760 
179820 
179880 
179940 
180000 
180060 
180120 
180180 
180240 
180300 
180360 
180420 
180480 
180540 
180600 
180660 
180720 
180780 
180840 
180900 
180960 
181020 
181080 
181140 
181200 
181260 
181320 
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aaattagaga 
accagccact 
tgcccactca 
cattcacagt 
tagtatgaag 
gccagcaggc 
cacagttact 
gttttagcca 
cccccctcat 
gctaactgtg 
atgtgcagaa 
acacccagga 
gttgtaaaat 
gcaagcaaaa 
ttaaaataag 
acattttaaa 
tgggaggccg 
gtgaaacccc 
taatcccagc 
tgcagtgagc 
aaaagaaaaa 
tagtatgact 
gtggatatga 
agcatgagtc 
gaaaacttat 
ggccgacacg 
accccgtctc 
ccagctactc 
tgagctgaga 
aaaaaaaaaa 
tgactgtggg 
tafctaagaca 
ttccatttac 
tggacagaaa 
cagagtcttt 
cagcctcctg 
cgtgcgccac 
ggccaggcta 
tcaatgggcc 
tattgaccaa 
aacagggggt 
ggttaaaaaa 
catacattct 
aagaaactca 
tcaggccact 
gatgacagca 
ttgaaatctc 
gacatgccta 
cggggagggg 
tgtcactctc 
ggctgtggga 
tgttgcctag 
cccccttgtc 
gggctctttt 
cccgacctgc 
actggctaat 
agtcttccag 
cctgcccagt 



tatgacagta 
gaactggggc 
aaggctgtgt 
gcttggtgac 
ggtcacaagc 
aaagggtgcc 
ggggctgggc 
ttcaatggga 
gctctgctgg 
acccfcgtgac 
aaatactctt 
acagggagcc 
ctcgfcggaaa 
accagtgtgg 
ctggatccat 
aagctggaat 
acacggacgg 
gtctctacta 
tacfcctggag 
tgagatggcg 
aaaaagctga 
gaccgtacgc 
aaactggaat 
ttaccataca 
atccagggcc 
gacggatcac 
tacfcaaaaat 
tggaggctga 
tggcgccact 

gctgaaataa 
tacttttaaa 
tgagtggatt 
tctcttgggg 
tggatacaaa 
ctgfctgccca 
ggttcaagca 
cacgctcagc 
aacctgggcc 
agacctcccc 
cctgaaggcc 
gataatacta 
aaagttgcac 
tccattctca 
ggtgaccttc 
ggcaccacag 
gtggctctgc 
actgcctccc 
taagcccggc 
aagaagctgc 
ctgggtgacc 
tccttctgca 
gaagagtggg 
ccctcgtcac 
cagggctaga 
ccctcccaca 
atgacccttg 
actcccacca 
gctctccttc 



gaccagctgt 
tcctggctgc 
ccaatggccc 
aaccaccctc 
cagggfcggtt 
tgctgtactc 
agccccatcc 
ccaggttgat 
tagggggcac 
ccaggggtac 
cgctgcccta 
agctaagtca 
attatfctact 
acacggtgag 
caccatagtg 
aagccaggtg 
atcacgaggt 
aaaatataaa 
gctgaggcag 
ccactgtact 
aataacacac 
tgaagaggat 
ggacattgct 
afcctagcaafc 
gagcacagtg 
gaggtcagga 
ataaaaaatfc 
ggcaggagaa 
gtactccagc 
cacacatcag 
tttcttcata 
ttgcaatgag 
aagctgtcag 
ccfcgggcccc 
ggctggagtg 
attcttctgc 
taacttttgt 
ccttttgagc 
cgacccaagg 
tcagtctcca 
cttcaaagag 
cttgttattg 
caccaactct 
ccaaggtcac 
ccacgacact 
tcatcttgat 
atctgacaga 
tgagaggaac 
tgatggggta 
atgaccaccg 
tcgtggcctg 
agagggctct 
ccccaagact 
tggacactag 
aaaatccatc 
cttggagggg 
agccagggca 
acagtgcaaa 



59/154 

gtcagcccag 
cctgctggca 
atacfccacga 
tgggacaaat 
gctgagcagg 
ccagcttgca 
ctgggggcca 
attgctacat 
atccctcaag 
tacttataga 
tttataacaa 
atgatgacac 
gacacaggta 
gccagctttt 
ttttcagaaa 
tggtggctca 
caggagatcg 
aaattagctg 
gagaafcfcgct 
ccagcctggt 
atcagaatat 
gcgtattgac 
atgacgggat 
ttcactccta 
gctcacgcct 
gatcgagacc 
agctgagagt 
ttgcttgaac 
ctggtgacag 
aatattaacc 
ttttctctgt 
gggagagagc 
ctgtaggaca 
ccaggttctt 
cagtggcaca 
ctcagcctcc 
attttfcagta 
agcgcagcag 
gcactcctgt 
ttttccccaa 
cctaactgag 
ccatagttag 
aggggatgtg 
agagccagaa 
gccttccatt 
ctggatgaca 

tggggcaacc 
ggggataggc 
ggtgtgcatg 
agtcatgcag 
ccgcaaagcg 

gagggctttc 

gcccttgaca 
gatcatgagg 
ccctgagagt 
gtggggctgg 
gcgtgggact 
gctccccatc 



agctggccca 
ctggactggg 
cccaagcaat 
gccattcttg 

ggggctggtg 

tgggcacaca 
actgggactg 
gacaaacttc 
aaggctttct 
atfcttccagg 
tgaaaacaaa 
atatcatgga 
ggagacagtt 
gtttttggaa 
aacaaataaa 
cgcctgtaat 
agaccatcct 
agagtggtgg 
tgaacctggg 
gacagcaaga 
taaccatggt 
tgtgctttcc 
ggagctttct 
gatgactacc 
gtaatcccag 
atcctggcca 
ggtggcacgc 
ctgggagctg 
caagactccg 
atggttattt 
cttccaaatt 
tatttttaag 
aagagtggga 
ttttgtttgt 
atctcagctc 
caagtagctg 
gagacgggtt 
accccccact 
tcaactctgg 
tccagctcca 
ccagagattt 
cttcacacct 
attgtctcca 
tggctggcct 
gctattgtct 
ttctaaatgc 
aaggcacagg 
acccagaagc 
taccttgacc 
cttctcatca 
gatactccgg 
cctgtcccca 
tcatccagct 
cttgaggcct 
gagcgtgggg 
tgccagagcc 
cagcccagtc 
cctggggcct 



cagacctggc 
gatgccfcgcc 
ttctgttgtt 
gaaactccag 
ggggtgccac 
gagtcctgct 
gctgcagaga 
aatccatgtg 
ggctgtacct 
ggaaatagag 
aacagccgta 
ccagtaggca 
cacaacagaa 
acaaagcttt 
taaataaata 
cccagcactt 
ggccaacaga 
cacgcgcctg 
agctggaggfc 
ctccgtctca 
tattttggga 
ttcfcttgctg 
tacaaagctg 
caagtgaagt 
cactttggga 
acagagtgaa 
gcctgtaatc 
gaggttgcag 
tctcaaaaag 
tgggatagta 
ttctcatgtc 
tgttggtttg 
acttcagagt 
ttgtttgaga 
actgcaacct 
ggactacagg 
tcgacccatt 
cagggccatc 
acccctgatt 
ccaagcagaa 
ggcaaagaag 
gtatcacata 
cttgagagaa 
agacctgaac 
gggtgaagca 
tctcatctca 
gaaaggcaca 
caggcagagc 
agaaagctgc 
aagtggacgt 
actctgggct 
ggaaactcct 
cccactggct 
ccccgccggc 
ggactcaggg 
aggaagggac 
ctctggatac 
acccctaagc 



181380 
181440 
181500 
181560 
181620 
181680 
181740 
181800 
181860 
181920 
181980 
182040 
182100 
182160 
182220 
182280 
182340 
182400 
182460 
182520 
182580 
182640 
182700 
182760 
182820 
182880 
182940 
183000 
183060 
183120 
183180 
183240 
183300 
183360 
183420 
183480 
183540 
183600 
183660 
183720 
183780 
183840 
183900 
183960 
184020 
184080 
184140 
184200 
184260 
184320 
184380 
184440 
184500 
184560 
184620 
184680 
184740 
184800 
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tgtgtcagtg 
gacaggaaca 
acactcaggc 
aagaagagag 
ctagcccacc 
aagggcagac 
tgaatgctca 
taggccctcc 
ttgagccggc 
ctggtgagcc 
tatcccttct 
cacatcacca 
acggcaggga 
gcaggggtcc 
ccaatcacat 
tgcgggccct 
gtcctaccac 
ccacgcagct 
accagcggga 
ccatggctca 
cagcagccca 
ctgcccacat 
ccagcatgtg 
cagcaccccc 
ccagggacca 
gtcacacaag 
gaacagggga 
gggccatatg 
accctgccct 
ggggcagtga 

caaaaagaat 
acagagatta 
cctcatttgt 
gaggaagaat 
ggaaagctct 
taagtcattc 
taaaccagag 
agaggacctg 
acccaggctg 
gagcgactct 
ctagctaatt 
cgaactcctg 
tgtgagccac 
tggaggtatg 
gcctgtcaca 
ggattcctgc 
tttgctaaaa 
acgagcccag 
gttttcaaat 
cccagcactt 
aacccagtct 
ccagctagtt 
tgagccaaga 
aaaaaaaaaa 
caaactacag 
gtttttgaga 
taatctccca 
cacgcaccac 



catgaggccc 
ggctctgggg 
cacctgggcg 
aatccaagga 
atcaaggccc 
ggatgagaca 
ggctgtagag 
caccaagagc 
ctcttggata 
ccacagaacc 
ggactctgaa 
gtctcagtgc 
cacaagagca 
tgctagggct 
tcgcacacca 
acctgcagca 
gccaggaagg 
cctgccagac 
gcctgaagga 
gggacaccag 
agtggtgcct 
tcaccaagag 
ctgagctgca 
agagactggc 
gttctggtgc 
caatgcagct 
ccacataaag 
caaaacagat 
gagtctcagg 
gaggcacccc 
aatgagtgca 
ttagaaaaga 
caccactgac 
tcagcctgtc 
tctttcggga 
tgcaacccct 
gggaaggttg 
gtatggtttt 
ggatgcaatg 
cccgcctcag 
tttgtatttt 
acctcaagca 
catgcctggc 
tgacttctcc 
ctgatgcctc 
caaaaaggtt 
acacagagga 
aaggcaggat 
gtcaagaaga 
tcacggatca 
ctaccaaaag 
gggaggttga 
tcgcgccacg 
aagaaagaag 
ctgattacac 
cagggtctca 
ggctcaagca 
catgcccggc 



ctgcctgccc 
gatctgcctg 
aggctgcagc 
agcttccaga 
tcagctgtct 
ggcagggtag 
gccgagagca 
cctgcccagg 
ctcggaggcc 
ctcaaatctc 
agaaacccaa 
tcaccataag 
cacacctcca 
tgcgtgagcc 
gcacacacgc 
atggccctcc 
cacttaccct 
accgccactc 
tagggagtgg 
aggagctccc 

gtactctcta 
ccctcacccc 
gcttctaggt 
aggaggggca 
tgctgacccc 
gggtgcggtc 
taagcgtgcc 
gcctgtggag 
cctgcctccg 
aggaagcctc 
actgatggaa 
agagagaaag 
ggtagcagtg 
tcttggcctt 
agaattcttg 
aaaggaacaa 
acaggaagca 
tgtttttttt 
gcacgatctc 
catcccgagt 
tagtagagac 
atctgctcac 

cgaggacctg 

cagtcacaca 
ctgaagtaag 
taacctggaa 

ga-gggg aa ca 

attctacagg 
aaaagaaagc 
caaggtaagg 
tacaaagatt 
ggcaggagaa 
gcacttcagc 
aaaaataaaa 
agctgatggt 
ctctgtccag 
atcctcccac 
taatttttgt 



atcctcatga 

gggtggcaga 
tgccactaaa 
cccacctcaa 
ctgagctggg 
gcgggagccg 
cggccctagt 
gtcctcttgg 
caggggacca 
accagatcac 
actcccttcc 
cccctcggct 
cagaccagcc 
ccagaacctg 
actcacacat 
taccagggcc 
tgttggctgc 
acgctcagca 

gg a gggcagg 

tttaggaagg 
gcacgggggc 
ttcctgatac 
gccaaagcaa 
gcctggaaag 
aaaggctggg 
agaggcaaca 
cttagagctt 
ggggtgtgcc 
cctcagctcc 
caatgggccg 
acagatggaa 
caaaacaaac 
ccctctttac 
tcttgtaacc 
ctaataaata 
atgctggagg 
gatactcaca 
ttttttctga 
ggctaactgc 
agctgggact 

agggtttcac 

ttcggcctcc 
gtgtttaaga 
ctggtctatc 
gcagcccaaa 
tctaaaccag 
aatttaatga 
atgactgacc 
aggctgggaa 
agttcaagac 
agccgggtgt 
tcacttgaat 
ctgggtgaca 
cgagggagat 
aaggttgctg 
cccaggctgg 
cccagcctgc 
agagacagag 



cccaaacctg 
aacagggggg 
cctcactggc 
gacccccttc 
tcgggttctc 
agctgaaggc 
cctctctgac 
cgggaggacc 
gcctggcaca 
cctggcctaa 
acagcctgct 
cccccaaccc 
taaggagtca 
gcacaaacac 
gcacacacac 
caccggaaga 
agccatggac 
gcctcccatg 
ggtcagggcc 
actcaaaccc 
ttctgcccac 
cccagtgact 
cagcatagga 
gaggacttta 
cctggagcta 
gggtgctcga 
tccctcctgg 
cccacctgaa 
ccaacggcag 
agctgggacc 
cacataaaag 
aaacccaagc 
tctgaaagct 
aaattgccct 
acaaagaaat 
caacaagaac 
ggcgcccaag 
gacggagttt 
aacatctgcc 
acaggcgccc 
catgttggcc 
caaagtgttg 
ggaatggcac 
tctctactct 
gtcacagcat 
cctctaactt 
ctccatgaga 
tgttttttgt 
cggtggctca 
cagcctggcc 

ggtggcgggc 

ccagaaggcg 
cagcgagact 
tattctagat 
cctgttgtgt 
agtccagctc 
caagtagctg 
ttgcccaggc 



aaagggcagg 
atcaaaaaac 
ccctggcagc 
cttcctctgt 
caaggccaga 
ctggactgga 
ccctggcccc 
tgatccagct 
tccacaatcc 
aaccctccct 
gtggccctcc 
ccacagtgga 
gcagggctct 
tgacatagtc 
acacaggtgg 
gcttgtgcaa 
gccctccctg 
ctccagggac 
acccatggac 
ctaccgctct 
cctctaccga 
cagggtcctg 
ctcctatccc 
ttggtatttt 
ccttattcag 
tggtgggcaa 
tgatcggtca 
ccccattcag 
cctcagcaca 
atctgagcat 
cagtgcattc 
cctcaaaagc 
gaggattaaa 
ggtggttgat 
gaccaaatgc 
cagtggatgc 
cgtcactgac 
cactcttgtc 
tcctgggttc 
gccaccacgc 
aggctggtct 
ggattacagg 
aagtttgcaa 
ctgcaggaca 
cactgacaag 
caacttctca 
aagcaatgag 
ttttgttttt 
cgcctgtaat 
aacatagtga 
agctgtagtc 
gaggttgcag 
ctgtctcaag 
aggaaagcag 
ttttgttttt 
actgccaccg 
agaccacagc 
tggtctcaaa 
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186840 
186900 
186960 
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ctccggagct 
agccgccagg 
ttcattcact 
gatagagacc 
agtttgccaa 
caggacaact 
gtgtttcgaa 
cctgtaatcc 
ggaccagcct 
gcatggtggt 
gaacctggga 
gacagagcga 
cgcctgtaat 
caagaccagc 
aggcgtggtg 
tgaacccagg 
caaaaggagc 
aaatgtaaat 
tatgtaaaga 
tgacactggc 
ggctgggcct 
agatgtggcc 
ccctgcctca 
tggaccaggc 
gaagtggagc 
cctggaaagc 
ccccccgact 
acaagagtgt 
gaggcagtga 
ctcaggttaa 
ctctgaactg 
tccccccgcc 
agccacacct 
aaatcccagg 
tgatatccct 
cccggcactc 
gaatgaccac 
gatatggtca 
ctcttttgcc 
cctcccactg 
ttgagaggga 
actcagtaaa 
aggaagcaag 
tggagaacct 
atcagaaaca 
agctaggcca 
aaggcctcac 
aggatctgcc 
atgtgtaaca 
gaagctctgt 
ctaaagcaac 
agccaggagt 
aggggtgagg 
tccctaggga 
tcacaaggaa 
atggtgcctc 
agcttccagg 
gcatgctccc 



taagcgatcc 
cctggcaaat 
tacatgttgt 
atatggcctg 
cccttgttct 
gacctgtttt 
ctacctatca 
cagcactttg 
ggccaacatg 
gtgcacctgt 
ggcggaggtt 
gactctgtct 
cacagcactg 
ctggccaaca 
gcgcatgcct 
aggcggaggt 
aaaactccgt 
ggcctggata 
attatgtcct 
tcatcagtcc 
gtggagggca 
tcacctcgct 
tgggcagggt 
cagcaacccc 
tacacccatt 
tcagcttgtc 
acagctgccc 
gaggccctcc 
cagcccagtg 
actgcaccag 
gtccatgcag 
ctgcctttgc 
tcgacagggt 
acaggactac 
cttcctcacc 
agcgcagcac 
atcacatttg 
gggccagtga 
tcacccctgc 
ggactcctca 
catgccccat 
gggcaaacac 
ggacttcagg 
gctggtcgct 
ggcaaagagg 
ccacacaacc 
catggctcaa 
cttgctagct 
caggaataag 
gacaatgcac 
agcagtttca 
gacccttaga 
gggcctccct 
gggtcagaga 
gatgtgcaga 
ccccccagca 
ccccagccca 
aacgtctgag 



acccacctca 
aaatttttgc 
ctgtttttgc 
caaagtctaa 
agatcaacac 
tttcaccaca 
aagaagacat 
ggaagccaag 
gcgaaacccc 
aatcccagct 
gcagtgagcc 
caaaaaaaaa 
tgggaggctg 
tggagaaacc 
gtaatccagc 
tgcggtgagc 
ctcaaaaaaa 
tgaggtaaca 
tattttgtca 
agggacagaa 
tgatctcaag 
ccccacttct 
gttcacagtt 
ctcttggagg 
cacctctggc 
cccagggaac 
ccaaagctgt 
tatcctgcag 
tcctcaaggt 
tggctggtga 
ggcaccctcc 
ccaggtcatc 
ctcagcccca 
agctcttttt 
ttacaagggt 
cagacaggga 
cttggagggc 
ctgcactggg 
cctggggggc 
ggccttgctg 
gcccactgag 
tgagggcctg 
aacccaaagg 
gtcctcgggg 
gcactgggcc 
ccaagcctgg 
tccacatggc 
gcatgggtga 
cagtggcctc 
cccccatctc 
gcagcaccgt 
gtgggcccca 
cctcctgcct 
acagatgctg 
atgaacatcc 
aaaaaaatta 
gggtgcagga 
acctcatttc 



gcctcccaaa 
aaataaagtt 
actaaaacaa 
accatttatt 
acacttaaca 
tcagctgcaa 
tttggggggt 
gcgggtagat 
atctctacta 
actcaggagg 
aagatcccgc 
aaaaaaaaaa 
aggcgggcgg 
ctgtctctac 
tactcgggag 
cgagattgcg 
aaaaaaaaaa 
ctaagaaatg 
gggaaacata 
gaggaactac 
gtccctctgg 
caatgtgggc 
tgggtaccac 
tgagcttcca 
cagggccact 
ttggtcttgg 
gaactcagag 
aggctgctcc 
ccccccatct 
gcaaaggctg 
ctgtcttcct 
ccttcagcca 
acatttcttt 
ggaatcccct 
agtgactgtg 
ggagctgtgt 
ctggccaagg 
ggctctgagg 
acttccctct 
cagtcagctg 
ggtaggaccc 
taaccctctg 
aaactgggaa 
agctagactc 
tcccctgctg 
ccctgcctcc 
ccccagaggc 
gttctttcac 
ttccttccag 
tgcaaaagca 
cagagaggca 
gggacgtcag 
cccccttctc 
ggagtttcca 
tgggaaactg 
tacccttccc 
agaggagcta 
tcattccttc 



gtgctgggat 
ttactaatag 
agttacttcc 
atttggccct 
gaccaaaaaa 
gaagaaaaat 
gccgggcgct 
cacctgaggt 
aaaatacaaa 
ctgaggcagg 
cactgcactc 
aggccgggtg 
atcacctgag 
taaaaaatac 
gctgaggcag 
ccactgcact 
agacatttgg 
agttgatcat 
ctgaacaact 
agggtggcct 
tgccaagctg 
aaagtccacc 
caggagcccc 
ggccccagcc 
gtggggtcag 
agagcagcct 
tatatgacaa 
agccacacct 
cccaggatgc 
caggcccttt 
CGcaacctac 
ggaacaatcc 
tttaaggatg 
ttctcatgtc 
tttttgtcaa 
ttggtttatg 
cccgtacttc 
aagcagaact 
ccctggcccc 
gttacctgtc 
acatctcatg 
gatagtgaca 
aaccaaacct 
ttgtgcacac 
cagctgtgtc 
agcctcctcc 
agcagtcctg 
ttctctgtcc 
ggtttgctgc 
aacccccaaa 
cttcaggcca 
ccttcttgga 
ccactcccaa 
gtccccctaa 
ggaaatgact 
catgagatgg 
cacccattca 
ttcagtcccc 



tacagacgtg 
aacactacac 
aagtagttat 
tcagagcaaa 
ggataatctt 
aaaggagagg 
gcggctcaca 
caggagtttg 
aatcagctag 
agaatcgctt 
cagcctgggt 
aggtggctca 
gtcgggagtt 
aaaattagcc 
gagaatcgct 
ctagcctggg 
gggatctgga 
ggaattgtat 
tagagatgag 
catagtttag 
ctctgggagc 
caggcccaag 
ctttggccaa 
cagggtgcag 
ttgcctcagc 
gccgtaaccg 
atggccaagc 
ctgcagccag 
ttcctgaagg 
tcacaaaaca 
tgcctgaccg 
cgtccctccc 
cacaccctgc 
tgggatcatt 
accctgtgtc 
tttgttgaat 
agccctaaat 
ccccacccca 
caccaagtgg 
catgcctgct 
cagtgccagt 
acatagaggc 
cctttctcaa 
acaattcctc 
cacagagaac 
tgtgccaacc 
gttcaaattc 
ctgtttccta 
aactgtgcct 
ggcctggttt 
atcctggagg 
aaacagctca 
agcagccagg 
ccagaggggg 
agggaggaac 
agtgtcagca 
cctctggtca 
attctcattg 
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tggttcgagg 
tcctctccta 
ctgtgtagcc 
tggggccaca 
gaacaaagcc 
agctttaatg 
tggctgttct 
aagggaccct 
agcattctcc 
cctaggccct 
ccagggggtc 
gctgcctcgt 
acaaaagacc 
gcagaaaaat 
gacttccctt 
tccctcctcc 
tccaggtgta 
cacaccccat 
ccaatccatc 
cccaccttca 
ctgccctcct 
catgcagcga 
acgcccccgc 
cagctgccac 
gctggggagt 
tcacggccgc 
gacctcggcg 
gggtgcaccc 
gctcaacgcc 
ccccgcctcc 
agcgggctcc 
gaggcagcgc 
cggcccaact 
ataggccccg 
gcttaggacg 
ggctgggtgg 
tcgatacggg 
gaccccaggg 
cccgtgcggg 
aagaaggcac 
tctgctcccg 
cgcggggtgg 
ctgcgggggt 
cgcccgtggg 
ttgcttccaa 
gggccggggt 
tctgctgatc 
tcgcgt erect 
tcagcagcac 
agcttttcac 
actaccttta 
cgtaggtgta 
cctcctccgc 
atctgacaac 
atttaccata 
agacatttgt 
ctctcttctc 
ttatttattt 



tctttctctc 
ccccagcctg 
tcccattaaa 
ctgacccgct 
tggaaaggca 
tgatttaatt 
ctctccttct 
caccacccac 
ccactttgcc 
gccctgggcc 
agecaaggtg 
agctcaggat 
agaatgggag 
gccctacctc 
tctcagtccc 
ttcataccac 
ggacccttta 
fccttgtcatc 
ctgagctgcg 
ctcacctcgg 
tcctgagggt 
ggcageggea 
cctcactacc 
cagttaaegg 
cttccaggcc 
tggggtccca 
aacgggaagg 

eggggceggg 

gagcacgagg 
gcccattggt 
aaagaaegge 
cctgctagtc 
aggecagggg 
cccccgctcc 
cacctccctt 
agctcccgaa 
ccgtgttggt 
tttggacctc 
cccgagacgc 
gcgggctggg 
gtctccatgg 
gatgtgggcg 
tettgeeggt 
gaeggattec 
eggaactaga 
cgctgcctta 
tccagtctgc 
ctcctgctcc 
tttgggtgac 
attctgatct 
gaccttgtct 
ttttgtttag 
ctctttctgg 

gtgggctcac 

gtccccgcta 
gatcccctcc 
atagecaaaa 
atttttgaga 



tgagccagga 
gggctcagtc 
gctcacgcct 
gactctcagg 
aggctggagg 
tccttctccc 
aacagtgagg 
accttcctgc 
ctcagctagt 
tgtgatccag 
agggtgggag 
ggatagaatg 
ggggatgggg 
cactttccag 
tgcttactaa 
egggagggag 
tcctctccgg 
tcagggggag 
aggeggegge 
ctgaegcagg 
gctgaggctc 
atgaacaggg 
gcactcatgc 
tcactgcgcc 
cggcccccaa 
ggcactgact 
aacegggagg 
gagggcaggg 
aagggagece 
tgatctggga 
caactggega 
cgcgcctgcc 
ccagaaccga 
ttcttccgcc 
gggcccttcg 
ggtggccttt 
ctegttcagg 
ttccccttcc 
gtcccgggtg 
ggctgggaga 
aggeggtege 
ectgeaatga 
gctctccgcc 
ccaaagtgcc 
gagaeggect 
aaggagcect 
acactgttgg 
aaagccctct 
tcatgtgccc 
cagcccatcc 
tgaaagtcta 
ctcacatggg 
aatcaggaat 
tttccagcct 
atccctgttt 
ccctcattat 
ttcaagccct 
cagagtctca 



actccgcaac 
ctggctcaag 
gtgaaaagaa 
actggacaca 
tcagtagttg 
tccagtgggc 
tgctgggggt 
ctccccaaca 
gggtgcttag 
aggtcccaag 
cacacaggat 
tggggggata 
tgctgccccg 
tgecaagatt 
tggaaacacg 
gtgcagccca 
cagccatccc 

gggggaaatg 

tctgtccctc 
agtctccgga 
ggaggctcag 
tcgtggggag 
ccccggacaa 
ccgggaacct 
cagcggagcc 
gctcaggaca 
cagggagcag 
acgaccactg 
cgcgccgcgg 
gggtggggcg 
gggccgccta 
gggegagetc 
ccactcgaag 
tgggggatag 
etctegggag 
ctccctgggc 
agctgaggaa 
accccatccc 
cctggccggg 
gcctgtgacg 
catggcagcg 
gecgaggage 
cgccggttcg 
ctcagtaagc 
gggeggcega 
caagtctgcc 
caaattaatc 
gggactgect 
tcgcgtttga 
actcctgccc 
aagttggcag 
gctttttaaa 
ctggtttcgt 
ggcattacaa 
etcaeggtta 
cccaccttcc 
cttcaaccaa 
ctctgtcgcc 



cttccaccct 
cactcagtcc 
cacccattga 
geagaggeca 
tggcagcttc 
caaaggtgea 
gggggtgggg 
agtgctgccc 
cctccagata 
aagcagaggc 
tatctcccag 
tccagctaca 
actttcttca 
caagaagaaa 
ggtccagaac 
agcccccgag 
tgtgggtgtg 
taateggaca 
ggagataatc 
gcccgcactc 
agatgetact 
gagggggege 
tcgcttcgcg 
gacatcactc 
cccccacccc 
gggcegggae 
aggggtggcc 
cagcggcggc 
ccgccctccc 
agggacgetc 
cgtcacgtgc 
tegegaggaa 
agggagaagg 
cgcctctagc 
ggctgtcggg 
ttccacgccg 
ccctccatca 
ctgtcttgaa 
cctggagaag 
cgcccccggg 
agatgegect 
gagaggegtg 
cgaacacccc 
cgtccggagc 
aggccagcct 
accctgtggc 
tttctgagct 
ccagtagcgc 
gagacagege 
actctaccca 
acggtgggcc 
agaagtcctt 
ggcttttgaa 
gcccctttaa 
gttaccttct 
aacctggaag 
gataagtttt 
caggcttgag 



ccctacctct 
agcagaagca 
ggecttgaga 
cacatactca 
acatcaactc 
aagataagta 
gaatatggag 
tcctctgccc 
gcatgcccca 
caggctggat 
ggacagggct 
ttttccctcc 
actccccgga 
ggcaagcgga 
ctaaatccag 
gccccaaggg 
gcacccccgc 
tcccccccat 
ctgtgcactc 
ccagacatca 
ggtccaaggt 
tgaccccatt 
gaaaacaccc 
tagttcaggt 
agccctgcca 
eggggagggg 
tcaccttgcg 
ggctgcagga 
gtcggcacgc 
eggaccaatg 
cagggtcgcc 
gaegggcagg 
agggectegg 
cttgaacctt 
cgcgtctcgg 
gcttcggcca 
ctcctgtttc 
agaagcaacc 
catcagaaca 
gaccgcagcc 
cgctcagcac 
gccctccggt 
acctatactt 
acgcagcgcc 
cccttcagca 
ccataacctg 
cttgttttca 
ttcacaaact 
ttacattgag 
ctcccaccac 
caaagtggcc 
ccccctcctg 
atatcagaag 
gagcacataa 
gggtctgtgc 
gccctccccg 
gectatttat 
ttcagtggtg 
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ccatctcggc 
ctcagcctcc 
gtatttttag 
tcaagtgatc 
gcctggccca 
ggtgtgggct 
attgtcaact 
aggctctgtg 
actgtggtct 
ccactgggag 
gaaggccaaa 
gaagtcaggg 
acgagtagtg 
tctgcaaggg 
aatcaggaac 
gccgggcgtg 
catgaggtca 
aatacaaaaa 
tgaggcagga 
ccactgcact 
aaatccactc 
gctacaactc 
actatgttgc 
cctaatgtgc 
taattaaaaa 
atgataaatg 
tcacccaggc 
tcaagcactc 
acccagctaa 
cggctaattt 
actcccaacc 
gagccaccgt 
ggccaggctg 
gctgggatta 
aactggcaaa 
agagtttctg 
tacaccatgc 
agccatgccc 
aaaaaagaga 
ccccatctct 
cagctaccca 
gaggcaagat 
aagaaaaaag 
ctaggaaccc 
gagtgtatga 
aatgtccata 
gctttaaaaa 
acaggcgtga 
catgttatac 
caaaatgcta 
attagctagg 
atattataag 
tgggttggca 
gcctgagagc 
caaggaagaa 
tcaaagacca 
tatttaggtc 
ctcatctgct 



tgactacagc 
caagtagctg 
tggagacagg 
tgcctgcctc 
agcctgcctt 
ctctcaaaca 
cctttcagca 
gagtggaagg 
cctgtccttg 
gagctgaaat 
gatggttctg 
caggagttac 
cccaggctct 
aaaccacagg 
aactccaggt 
gtggctcacg 
ggagatcaag 
attagccggg 
gaatggcgtg 
tcagctttgg 
accatgagaa 
caaacacacg 
ccaggctggt 
tgggaccaga 
ttaaacaact 
aaatgagtaa 
tggaatgcag 
ctcccacctc 
tttttgtatt 
tgtattttta 
tcaggtaaac 
gcctggccaa 
gtctcgaact 
caagcttgag 
ttgttttcca 
ttgcttcaca 
taaccaaaca 
ttgtagagga 
aaacacacac 
actaaaaata 
ggaggttgag 
agtgtcattg 
agagaggaaa 
cagttctgga 
ggttatttat 
aatagaagac 
gaaaaaaaaa 
gccaccatgc 
aagctggtct 
ggatfcacagg 
attctccaga 
gaattgactc 
agttggagac 
caggaaagtc 
tcaatgtttc 
ccaggcaata 
tcaactgact 
gatttctttc 



ctccaccccc 
ggattctagg 
gtttcaccat 
agcctcccaa 
tttcaaaagt 
caaactggct 
aacatttact 
cactcacagt 
ccaaatgttc 
acagagctgc 
gaaagcctcc 
ctggctgccc 
ggggcttcct 
cccctaccag 
taggagaaca 
cctgtaatcc 
accatcctgg 
catggtggct 
aaccccggga 
gcgactgagc 
cctggtgact 
gttttttgtt 
ctcaaactct 
ccagaggcac 
ataacctcat 
ttccactttt 
tggtgcgatc 
agcctcccca 
tttttttttt 
gtagagacgg 
tgcctgcctc 
tttttttgta 
cctgacctca 
ccaccatgcc 
aaatgactgt 
tcctcatcaa 
aagcagggca 
caatttcaca 
acacacacac 
caaaaaatta 
gcaggagaat 
cactccagcc 
aaaaacacaa 
atttatcatt 
tgttgcattc 
ttcttaaata 
aaagaggaag 
cgggataatt 
cgaactcctg 
catgagctac 
gaaacagaac 
acacaattat 
ccaggagagc 
aatgatgtat 
agtttgatcc 
aaaattctct 
ggatgtggcc 
ttttttcctt 



tcccccgccg 
tgcgcgccac 
gttggtcagg 
agtgctggga 
ctttctagcc 
gctaactgca 
gaatgtatct 
ttagtgtagg 
ctcacactcc 
ccatgcctag 
aggagaaaat 
catcactgcc 
gtgcggaggg 
gccccctgga 
gatggagcac 
cagcactttg 
ctaacacggt 
gatgcctgta 
ggcggagctt 
cagactccat 
tttaaaaaaa 
ttttttttta 
tggcctcaag 
aagccactgt 
tggaaaactg 
tttttttttt 
tcagctctct 
gtagccagga 
ttttgagatt 
gtttctccat 
agcctcccaa 
tttttaatag 
ggtgaccgac 
cagccatggc 
acttacttac 
cataaaaatt 
acggactacc 
acagctatca 
acacacacac 
gctgggcgtg 
cgcttgaacc 
tgggccacag 
caaagattac 
tagatatact 
ttgtgatagc 
aatttggtac 
ttttcaatgt 
tttgtatttt 
gcctcaagcg 
cgtgcccagc 
taggaaagag 
ggagtctgac 
caatggttta 
ctccagtcca 
caaaggcgag 
cttatttggg 
cactcacatt 
ttgtttttga 



ggttcaagcc 
cacacccagg 
tgggtctcaa 
ttacaggcat 
aacttctcaa 
cagcccgggg 
gatgagtaag 
agggacacca 
caggcatcgg 
ggatactaca 
tagactcttt 
caatgtgtct 
ccttaccaaa 
gacagaggca 
gtataaaaca 
ggaggctgag 
gaaaccccgt 
atcccaacta 
gcagtgagcc 
ctccaaaaaa 
aacaaaaaaa 
tttttcagag 
cgatcccccg 
gcccagccct 
cttggcagta 
tgagacagag 
acaacctctg 
ctacaggcac 
acaggcatgt 
gttggtcacg 
agtgctggga 
agacggggtt 
ccacctcaga 
atgtgtttaa 
actcccatta 
tgcaattttg 
agtcttcagc 
agattacaaa 
acacagccaa 
gtggcaggtg 
cgggaggcag 
cgcgagactc 
agatgcatat 
tccccatgat 
aaaagattgg 
agacaatata 
cctcccaagt 
tagtagagat 
atccacccac 
ctgctaagga 
ggcagatagg 
cagtcccaag 
gttccagtct 
aagggcagca 
aaaaaagctg 
gagggccagc 
agagagggaa 
aatggattct 



attctcctgc 
gctaattttt 
actcctagcc 
gagccaccgc 
aaccatctct 
ttatttccga 
ttattgtgct 
cacacaaaga 
gcagtctggc 
tgctctcagg 
gtgtcaggag 
gtgttaccac 
gatagatggc 
ggacaagaag 
tccactcacg 
atgggcggat 
ctctactaaa 
ctccggaagc 
cgagattgcg 
aaaaaaaaaa 
cacaacactt 
acagggtctc 
accttgggct 
ctattctttt 
cctaataaaa 
tcttactctg 
cctcctgggc 
atgccaccac 
gccaccacgc 
ctagtctcga 
ttacaggcat 
tcaccatgtt 
ctcccaaagt 
ttttaaaaga 
gcagagtata 
taaaaaacgc 
ctctcctctg 
tgcacacaca 
catggtgaaa 
cctgtaatgc 
aggatgcagt 
tgactcaaaa 
attctttgat 
actaaatcat 
aaataactca 
atgttatatg 
agccaggatt 
ggggttttgc 
ctcagcctcc 
aagattttgt 
aaagactgac 
agctgcaggt 
gagtcccaat 
ggtttgagac 
atgttccagt 
ctttttcttc 
atctgtttca 
cactctgttg 
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cccaggctgg 
agcgattctc 
cggctaattt 
gaatgcctga 
gtgagccacc 
ggtttacccc 
ggtgttcggg 
tggtggcatg 
cccaggaggt 
aaaacaagac 
acataaagtc 
gtgctgggaa 
ttatctattc 
tagggttgtg 
atttagtaca 
tagcagattt 
aatcccagca 
agcctgggta 
aaaagcccag 
gtggatcacc 
ctactgaaga 
aggaggctga 
tcatgccgct 
agctggatat 
ttgcttgaac 
ctgggcgaca 
gctcacggct 
gagtttgaga 
tagccgggtg 
atcacttgaa 
cctgggtgac 
ttaaaaatta 
acatcctatc 
attcaaggaa 
gttaggagct 
gttatctctg 
tttgtaataa 
attatggaca 
aatatttaaa 
ctgagcccca 
cacttgcccc 
atctctccat 
aacctcacct 
ggaccaggag 
gacaggtttg 
ggccaggtcc 
aagtctgtct 
taatagactc 
gatggctatg 
gctttcagtt 
gcctggatcc 
tgcagccttg 
cccacagaaa 
ataatttgct 
caaagagtgt 
aggccaaggc 
gaaacctgtc 
tctcacatac 



agtgccgtgg 
ctgcctcagc 
ttgtattttt 
cctcaggtga 
gtgcccggcc 
tgttttccca 
tcaacatggt 
agcctgtagt 
ggagggtgca 
tcttctctca 
attccggaag 
taagaagact 
aaaacattaa 
aacactggag 
tagtaggtac 
agtgtcggtt 
ttttgggagg 
acaaagcgaa 
gagcagtggc 
tgaggttagc 
tacaaaaatt 
ggcaggagaa 
gcactccagc 
ggtggtatgc 
ttgggaggct 
gagtaaaacc 
gtaatcccag 
ccagcctgac 
tggtggcgtg 
ccagggaagt 
agagcaatac 
aattaaaaat 
acatttgctg 
agggtacaca 
gtctactaca 
agtggaagca 
ttccattctt 
aatggggaga 
atattcaata 
tccatgcact 
aaactaaccg 
agtctgcaat 
gtaggctgct 
acagggcagg 
atggctggta 
tgatggttct 
ctgagaagtg 
tatcaactct 
tgtcaaggaa 
tgacagccca 
ttcctcagtt 
taggaaacat 
atgtgagata 
acacagtaag 
ttctgtgggc 
aggtggatca 
tctaccaaaa 
ttgagaggct 



catgatcttg 
ttcctgagta 
agtagagacg 
ttagcccgcc 
tcatctgctg 
gcactttggg 
gaaacctgcc 
cccagctact 
gttaactgaa 
aaaataagag 
gatacataaa 
tttcactttt 
gtaacaaagt 
ggaacaaatc 
tcaataaata 
agagagagga 
ctgaggcaag 
atcctgtctc 
tcacacctgt 
agttcgagac 
agttgggcgt 
tagcttgaac 
ctggatgaca 
acctgtcctc 
ggggttgcag 

ctgtctctaa 
cactttagga 
caatatggtg 
tgcctgtaat 
ggagagtgca 
tctgtctcaa 
aagggagtta 
tattctactt 

tgggggtgac 
aggaggaaaa 
agggggagag 
ggggcaggtg 
gagtgtcatg 
ataaatgaaa 
ccacctggtc 
ttagcccttc 
aggcccccgg 
atatgtaaag 
ctaacagttc 
tcacgacctc 
tcctctttga 
cacctctgcc 
ctcctttcct 
atgagagtgg 
ctaggaagta 
gtgccttcag 
tgaagtggag 
taataataag 
agataatgta 
caggcatggt 
cttgaggtca 
atataaaaaa 
caggcaggag 



gttccctgca 
gctggggtta 
gggtttcgcc 
tcggcctccc 
atttaaatgt 
aggccgaggc 
tctactaaat 
caggaggccg 
attgtgccac 
taagacaaat 
aactaaagat 
tatgtgtttt 
ttaaataccc 
atgttttata 
gaggaaagaa 
gaggggccag 
aggctcactt 
tccaaaaaaa 
aatcccagca 
caactaggcc 
agtggcaggt 
ctgggaggca 
gagtgagaca 
ccacctattc 
tgagccaaga 
aaaagaaggg 
ggccgaggct 
gaaccccgtc 
cccagctact 
gtgagccgag 
aacaaaaaaa 
taatcttatc 
atcagaagca 
gtccaggagg 
agcaatgaaa 
atgcaagaag 
gtggatttat 
aaccaaggat 
taaaaattgc 
ttagtttaac 
acacacactg 
agttctattc 
atagtaggaa 
tgccttagca 
acctattggc 
ggcttctgct 
acagcaccag 
gtctttgatg 
ctattggcca 
aattcagtca 
atgagactgc 
gacccagcca 
tatttgttgt 
tacaggttct 
ggctcacgcc 
ggagttcgag 
ttagccagat 
aatcgcttga 



acctccacct 
caggcaatca 
atgttgacca 
aaagtgctgg 
taaccttagc 
gggcggctca 
acataaaaat 
aggcacgaga 
tacactccag 
gtgcatgttt 
agaggttccc 
ttttaaatca 
ttgcatgcct 
ctcacttttg 
cgaatgaatg 
gcacagtggc 
gaacctagga 
aaaaaaaaaa 
ctttgggagg 
aacatggtga 
gcctgtaatt 
gaggttgcag 
tctcataaaa 
tagaggctga 
ttgtgccact 
agttagggct 
ggtggatcac 
tctactaaaa 
tgggaggctg 
atcataccac 
aagaaagaaa 
taatgtaatc 
agtcactaag 
cagggattac 
ggatgaaatc 
cacatggccg 
aggtgttcat 
tatatttagt 
caactactcc 
actggctcca 
acccttgcta 
cctgaaactt 
tggctgatgg 
gtagcactgt 
ctcctacatg 
cctggtatcc 
gcctgtaact 
aagtaagcta 
acaatgagta 
gcaaccaagt 
agttctattg 
agcctggcct 
tctaagctgc 
tttcatttct 
tgtaatccca 
gccagcctag 
gtggtggcat 
actcaggagg 



cccgggttca 
ccaccacgcc 
ggctggtctt 
gattacaggc 
cgggtgcggt 
ctcgagctca 
tagctgggca 
atcacttgaa 
ccagggtgac 
gtttatagtc 
tattagagat 
tatatatgtt 
tcctctcctg 
tattcctggt 
aaccagaaga 
tcacacatgt 
gttggagacc 
aaaaaaaaaa 
ccgaggtggg 
aacaccatct 
ccagcaactc 
tgagccgaga 
aaaaaaaatt 
ggtgtaaaga 
gcactccagc 
gggtgcggtg 
ctgaggtcag 
atacaaaaag 
agacagagga 
tgcactccag 
gaaaaatttt 
acaaaagttg 
tccagcctac 
tgggagccac 
taggatggtg 
gtgaaagtta 
aagtgagttg 
ccaattcagt 
tgaccttgtt 
cctgaccctc 
tagcaccacg 
ttatccatgt 
agtggcccga 
gggcccagaa 
gcaaaggctg 
agatatagct 
cccttgttgg 
ccatactgga 
aaacactgaa 
aagcctggaa 
gccctttgct 
ggactcttga 
tacattggtg 
tttgctatta 
gcacttttgg 
ccaacatggt 
gcacctgtaa 
tggaggttgc 
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agtgagccga 
aaaaaaacaa 
gaacacagaa 
tggaattgtt 
taggctggag 
caattctcct 
gctaattgtt 
ctcctggcct 
agccaccatg 
aaaccattaa 
gcctggcctc 
ggccgagtta 
aacaatttga 
gggtttgggg 
tgacccctgg 
aagatgagat 
gaggacaagt 
aaatagaaca 
gcctcagtac 
ctccctactc 
cgccgaggtg 
gacagattcc 
tcggagctgg 
aggcccagcg 
aggctaggtc 
atctccggag 
atctcctccc 
gagagcagcg 
gtcggggctc 
cttacgctcc 
gagcaggtgc 
tcaagcctgg 
agccttcata 
tcagcgatct 
cgaggcgccg 
cgggccctac 
ccctttcctg 
agagatcacc 
cccggggggt 
gtggaggaat 
cacaagtgac 
gccaccttac 
cccgggggtc 
cgggccccca 
tggcctttct 
ggagcggggt 
caggcgctgc 
cttgcgcatc 
tagacattgg 
aagggggcat 
atctgagctt 
tcttgatttt 
acagtctcag 
tacctgttca 
ttgtgcatct 
ggagagggga 
gtgggggtgg 
ggaaagggga 



gatcgtgcaa 
aaccaaacaa 
cacaagagcc 
agatctcagg 
tgcagtggtg 
gcctcagctt 
cgtattttta 
caagtgaccc 
cccagctgga 
agttagggtc 
aatccatgga 
ggggtcagtc 
agtctggttt 
gtcaatctaa 
tgatatcatc 
ccactctggg 
gaagaagaaa 
caacactgac 
cccgtgcgcg 
ctggcgatcc 
ggtcccaggt 
cccatggctg 
gcggggacct 
ccgggagatg 
tctgcacgtg 
tggtaggagg 
acgctcaccc 
ggctccgccc 
tggcgggtac 
gcctgctgga 
tccaggcgcc 
cggtctgctc 
gacccgtcac 
ggaggacgaa 
gcggccggtc 
actcaccctt 
cctcccgtct 
agcctggtga 
ccgggacacc 
gactgtgctg 
aggaagaagg 
ccacatccat 
cgctgtaagc 
ggccaggctc 
gtggtctggg 
ggcacctgta 
ctcagcggga 
acgtcatccg 
gatagcagat 
ggtggttgct 
ccttgtgtgg 
gtaatcggac 
cctgggcaac 
gatgacccca 
ccaccatctt 
gcagacatgg 
gagaatcagg 
aggataagga 



ttgcactcca 
aaagagtgtt 
taggtttccc 
atgtttgttt 
cgatcttggc 
cccaagtagc 
atagagatgg 
acctgccttg 
tctcaggatt 
atgccatggc 
agttgggatt 
cttacgtagg 

ggggttgggt 

ggccagggtt 
aacctaccaa 
tccagaaagc 
ctggctcccc 
tttaatgggg 
aggagggagg 
cacgcctcac 
aagctcccag 
agggcatggg 
gattctgggg 
caaggccggg 
gcttctgggc 
cggctcagtc 
atcccaagga 
tcccaatggt 
ccgggcgtcc 
gccgccggga 
ttaccttcat 
cctcctgccc 
ttgcacggtc 
gctggggtcg 
ccagctgggg 
ccagggccct 
caggctcagc 
agtctggaac 
ttcacctcgg 
gagtctcatt 
acagaaggaa 
gtagccactg 
ccgggctcag 
tattagccaa 
gagcgccccc 
aggagccggc 
tgagcaccag 
gtactgaaaa 
tgggtcctgg 
gttctgagtg 
gtgttccgag 
gaattctgtc 
caagaccata 
gttgctgagc 
ggggaagtaa 
aaagtggggg 
catggcagag 
gggagacaaa 



gcctggacga 
gctatgaacg 
ttgggtatgt 
tgttttgaga 
tcactgcaac 
tgggactaca 
gttttcacca 
gcctcccaaa 
tacaaatgtt 
tgtagtgtgt 
gtctgtgggc 
gttgagggtg 
ttggacaagg 
agaagttagg 
gctgtggccg 
tctttcactc 
tcagatgcaa 
cagccctgag 
ggcgactgct 
cccttcctct 
gaggggcagg 
gaactagcct 
gtgtggacag 
cccaaggtgt 
tctggaaagc 
ccgggcctgc 
aggagggcac 
ctatccatcg 
ccgcgcggcc 
gaggtcttcg 
ctgcaggacc 
ggcctctgct 
tcttgagagg 
ccctgcaacg 
tgggaagcag 
agaccaggcc 
gtaccgtgcc 
tcccagcttc 
gccttctggg 
ctcctgcttc 
agctccaacc 
ccatcctggg 
tcatccccaa 
gcacccctcc 
cacccagccc 
gccctgcttc 
ctccaccacg 
gttcagcctc 
ggaaatgaag 
tacagccctg 
gttccttgcc 
tgcgttgacc 
ggagatgatg 
ttctgctgga 
aacttgatga 
catgggtaag 
tggtggaacc 
gacattaaga 



cagagcaaga 
ttgttgtaac 
gctgggtata 
cagagtgtcc 
ctctgccccc 
ggtgcccgcc 
ttggccaggc 
gtggtgggat 
ccacattact 
tggccagggt 
agggttggag 
agtgtgtggc 
ttggagaaca 
ctgtcatcct 
cacaggaccc 
aaggcggggg 
aaagagaagg 
ccgtaacctt 
acgggcacat 
ccagctgctg 
ggagcccctc 
ggatggagac 
agccaaggga 
tggacactgc 
ggtccattct 
gctcctagag 
tcgggcccca 
gtgggtgggt 
cgggccgccg 
atccgcacat 
taagccgcac 
ggccgcgagc 
ccaacagctc 
cccggttata 
cttagcggag 
ttcccttctc 
ccccaacccc 
tttacctgag 
atacacctga 
cccctcccta 
gggagggaga 
gage c age tc 
ggagccccat 
cccacactta 
ccagagcagt 
ctgcgcctct 
cctggggcgc 
ttcagtacct 
cgaggggatg 
ggggagtgaa 
tccttcctaa 
ctccccaccc 
gggctgeaca 
gagagttggc 
cctctgcaac 
gaaggaggag 
caatggaggg 
ttagtgatag 



ctccatttta 
atgtctccca 
tgcctaggaa 
ttctgttgcc 
cagcttcaag 
accacgcctg 
tggtcttgaa 
tacaggectg 
ggtagtcaga 
tgggagtaaa 
tegtacagag 
cacatttggg 
ctgtgagggt 
gtgeaaaace 
agccacccac 

tggggagatg 
caggcacagg 
caaaccttcc 
cgtcgatgtc 
cggtctccgt 
agcggcccgg 
gccgccgtcc 
ccgcccccca 
tttgggggac 
cctgacccgg 
ttcctgtccc 
geaggctegt 
ccggcgcggc 
ctcaccgctg 
tcttgagctg 
cgcgcaggcg 
ccccacccgc 
ctgctccttt 
cgctggaggc 
gcttggacct 
cccctctagc 
agctgtgtgc 
ctgaccccct 
gacaaggcaa 
cttccactct 
aatggaaaag 
ctgaaagaac 
gtcccttcca 
tcacccaggc 
gggagaagct 
gcctctcctc 
actgegegat 
tcctgagggg 
ggggagaggg 
agcgcatcgc 
tagaagagtc 
tgccaccaaa 
ggtattctgc 
ggggacataa 
aaggactgag 
gtgagagaag 
gtggagatgc 
gtagaggaga 
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ctaagtctcc 
atactgagtc 
taatcccaac 
cagcctggcc 

ggtggtgcat 

gaggtggagg 
tgagactctg 
ttcaggagcc 
cacacacaca 
catgtgataa 
aaacacatgc 
agccacatat 
acatgaaaag 
acacaaacac 
actaaacatg 
catagcaatg 
tcattaatgt 
atgtagacat 
acacacatct 
aaacacaaag 
acacacaagg 
acagagggat 
tcgccattgg 
acgacccttc 
ctaatccaga 
accattcccg 
gctacctcct 
agaaagcatc 
cagccaaagc 
tcgacggcca 



agcatacctc 
tctggtcctc 
actttgggag 
aacatggtga 
gtaaacccag 
ttgcagtgag 
tctcaaaaaa 
cagggaattc 
ctccaacacc 
gatcccccta 
tagacacaca 
gcacacagcc 
atgacagata 
cttcacaatc 
tcaacacata 
agtgagcaca 
ccctggaggg 
acatatatgg 
aaaatttgaa 
agtttgcaca 
gcacacacct 
gttgtctacc 
cgtcctcacg 
aggcgcttcc 
gactcacgtc 
actctcctgg 
ctcaggcccg 
acttctgtgc 
caaccaggac 



agaggagctg 
cagtagagaa 
gctgaggcgg 
aaccccatct 
ctacttgggt 
ctgagatcat 
aaaaaaaaaa 
acatggatgc 
aaagttccag 
cacatagata 
taggcacaca 
tccccaaata 
agtatttctg 
acaaagaaac 
tgaacacaca 
tacagaaatg 
agttagaggc 
acacacatcg 
gtcccaggag 
gagacccgtt 
ccatcgctaa 
cacaggtaca 
gcctggccgc 
tttctcgcca 
ccagctggga 
gtacaaagac 
agctttgaaa 
ccacgccacc 
cccagaattc 



catcccttcc 
tgagtcagcc 
gtagatcacc 
ctactaaaaa 
ggctgaggca 
gccactgcac 
aaaaaagaga 
atgatgcaca 
gagagtaaaa 
catagggcag 
tgtagaacac 
tacagataca 
tcattgagat 
acatgcacac 
catgctcaga 
cataaataca 
acccccccca 
ttcacacaca 
gcagattcct 
acacacgaac 
agtcccggga 
gctggtgcag 
cccagcggtg 
ctgcagaagc 
gcggccatat 
ttctgggagt 
aggcggttag 
tagtgaccag 
cgcggatcct 



ccactaattt 
agtgcagggg 
tgaggccagg 
tacaaaagtt 
gaagaatcgc 
tccagcctgg 
atgagtctcc 
cacacacaca 
gactcacgta 
aatgtgtaga 
acatagacag 
gagatgcata 
cccaggagac 
aagacccccc 
ctcccagaag 
tatgcagaca 
acatgcacac 
gatgcaaata 
ccccactccc 
atacaggaca 
gaggtttcgc 
cgcctcctcg 
ccgggtcccg 
ccataactgc 
tgttttctga 
tgtagtttta 
ctgcccttgt 
ccactcatcc 
tctatagtgt 



aggaatggga 
ctcacacctg 
agttcgagac 
agccaggtgt 
ttgaactcag 
gcgacaagag 
aagagaagct 
cacacacaca 
agcagagaca 
cacatagaac 
aagcacaaaa 
gaagtacttg 
attcctagaa 
caaatacaag 
acatagaaca 
catatcttaa 
aggatagaac 
gaaaacaagc 
acaagcacac 
gaacatgcag 
ttgggccggg 
tccacgctgc 
ccccagcctc 
ctctgcctgc 
aaccatggaa 
tcccctcccc 
cttcttccca 
attttgggac 
cacctaaatg 



205740 
205800 
205860 
205920 
205980 
206040 
206100 
206160 
206220 
206280 
206340 
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206520 
206580 
206640 
206700 
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206880 
206940 
207000 
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207360 
207420 
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<210> 6 

<211> 23574 

<212> DNA 

<213> Homo sapiens 



<400> 6 

gctggacctt caacagagaa ggatggctgg 
ctagtgcctc tggagcagga gcacagtgaa 
aggagccttt gctgcagcaa ctccccgccc 
caggtaaacc tgggagccac gggattcggc 
actagccctg ccacgcacac ccagcaacac 
tcccatgcta aactaggaac gctatgaaac 
ctattttaaa cttaaaaaaa tttttttgaa 
ctggtcttga actcctgggc tcaaaccatc 
tgacaggtgt gagccaccat gcctggccta 
gacttgcaga aaagttggaa aatgtaagca 
tgatggtcca tcttacttaa tcatagtcca 
caatcctgtt aacgaaactg tagactgttt 
tttctgttcc aagatccatt ctaggatccc 
tctgggagtt ccttcatctt tccttatctt 
agttattttg cagaatgttt ctcttgagtt 
cacagcattg ttttgactaa ccaaaaagtt 
ctcatccaac ctgggcaaca gagtaaggcc 



cagagtccta gtctgatagg cgcccccttc 60 
aacccagctg tctgcgctga tacccctggg 120 
tcattaagtc tccaccccag gctgcaccgt 18 0 
gcctgaatgt gcctagacgg accccaacac 24 0 
acagacaacc acccagggag acctggtctc 3 00 
agcatttttc ttttcctttt ttatataatt 3 60 
tagagttggg gtctcgctct attgcgcagg 42 0 
ctcccacctc ggcctctcaa agtgctagga 480 
ttttaaactt cttattgttg actaatttta 540 
ttttccttac acccttcacc cagcttcccc 600 
atcattacag ctaggaaact aaccttggta 660 
tgcatttctc atttttccac taatgtcctt 720 
acatcacagt tgtctcctta ttctcctcaa 780 
tacccttgac acttttgaag aatcctggcc 84 0 
gtctgatgtt ttattctgat cagaatgaga 900 
attctataag taaatattga ggttaaaaat 960 
ctgtctcaaa taaagtctca acactaagat 1020 
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ttaaaaagtg accagaaaag cccccactat 
acaaacaaac aaacaaacaa acaaaaacga 
gtgcagtggc acgatcttgg ctcactgcaa 
tgcctcagcc tcccaggtag ctgggattac 
tgtattttta gtagagacaa ggtttcacca 
ctcaagtgac ccgcccacct ctgcctccca 
cgcttggcct cattagcatc ttaaatctcc 
gagcaaagga tcagtttgag gacaggtaaa 
agtccctgct gaagatagct ttgcttgaat 
tgttgactgt acggtcacca cagttgctgc 
ctacctatcc tgtctcagta catctgtctg 
ttaatgaatc agaagactgt gatgtgcttt 
aatgtaaacc aagatgcccc tgcagtggtt 
aaggggtcca gattcagacc ccaggagagg 
agacgagtcc ataaagtgaa agcacattta 
actccataga gagcgcagcc ctgagggctg 
at gat ct get aaacaggggt ggattgttca 
cttcctgata ctgccatggc atctgtaacc 
gggaccgacc agaggtcact ctcatcacca 
tatattgeaa gctgattttt ttggttggtt 
tggagtgcag tgacatgatc tcagctcact 
ctcctgcctc agcctcccaa gtagctggga 
tttttgtatt tttagtagag atgaggtttc 
tgacctcagg tgatccgccc acctcagcct 
accgcgcctg gcctactgca aactatttta 
atctcatcct gtgaegcaga atgctgtaac 
ccttattttg ctcagcccct attcaggatg 
atcctcttgt gttttggcgt gtgggaggaa 
agctctgacc acacagggca ggtacactct 
ggaggggcag agagagaaga aggtgtgaca 
agcccagttc ctcccggggc ageccaagta 
tcacagctga gctgggctcg ctcgctaccc 
cctcctgctc agectcaggg tccatgccta 
ctcctgcttt tctgctgttg gccccatgcc 
gttctgeage acagecaggt gatcctgggc 
ctcccctctt agagccctct ttccttccca 
gagaaggggt ggagggggtg ctgggcagtc 
ctccccaggg tcccttgtag ctggagtccc 
aagagcgggg gaggtatatt aacgtgtatt 
tttgagaegg agtcttgetc ttgttgccca 
actgcaacct ccacctccca tgttcaaaca 
ggattatagg cacgtgccac cacaccgggc 
ttcaccacat tggecagget ggtctcgaac 
cctcccaaag tgctgggatt acaggcgtga 
ttatgatggt ttaacatctc ggggtggggg 
ctggagatca gaaacaactc atgectttea 
accaaccacc cccttcaagg aactctcaca 
cagctcaggg ccaggcacct acccagacaa 
ccagaatgat teaaaatgee aatcctaccc 
ggaaaeggea ctgtgggctg tggcctgtgc 
ccaaacctag tgcctcccca ctgtggccct 
tttcaacagc attggectet gtgtcatcag 
caactgaggc aggagggggc tttagagagc 
tggagecega ggtgaagttg cctccctgca 
aacagctggc tgatataggc cactcagcag 
cgcccaaggg aggcaggagt gacagagcaa 
tgtgcccaga cacatgcctg gggctctggc 
ccccgtcagc caccacccca ccaaccacca 
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gatttgtctt gacttttttt taaaaaacaa 1080 
tgtcttgetc tctctctccc tcaggctgga 114 0 
cctccgcctc ccaggttcaa gegattctte 12 00 
agacgcccgc caccatgccc ggctaatttt 12 60 
tgttggccag gctggtctcg aacttctaac 132 0 
aagtgctgga attacagacg tgagcaactg 1380 
acacaggggt gtgttcctta ctgttataag 144 0 
ataaaaatgc gcttgctgcc tagagggaga 1500 
gagctcaatt geaatgecag tgetgagget 1560 
tgcgcgccta gaacatggtc actttcttga 1620 
tggtttgtgg tggtccattt cctaattttt 1680 
ccgctgtgct aaccatggcc gctgaagcaa 174 0 
gtgettcact ctacgacatc tgttaccgga 1800 
gttcttggat ctcgtgcaag aaagaatttg 1860 
ttaagaaagt aaaagaataa aagaatggct 192 0 
ctggttgccc atttttatgg ttgtttctgg 1980 
tgtctcccct ttttagacca tatatggtaa 2040 
tgtcatggtg ctggtgggag tgtagcagtg 2100 
tcttggtttg ggtgggtttt agccagcttc 2160 
tggtttttga gaeggagtet cgctccaggc 222 0 
gcaacctcca cctcctgggt teaagegatt 2280 
ttacaggcac acaccaccat gectggctag 234 0 
accttgttgg ccaggctggt ctcgaacttc 24 00 
cccaaagtgc tgggattaca ggcgtgagct 24 60 
tcagcaaggt ctttatgacc tgtatctcct 252 0 
tgtctggaaa cgcagcccag taggtctcag 2580 
gagttgetet ggttcacagg cctctgacac 2640 
agaggggtga gggaaggaaa ctcaaaacca 2700 
cccacctgtc tgtgggtgcc acaagtcaag 2760 
gatggecgea ggccacagaa tgtcagagga 282 0 
gctggtagtt gggtggccaa acagagggcg 2880 
ccagctcagc. gtccactctg cccctcagta 2940 
ccctcctgct tcccagtcac ttctgcgtgc 300 0 
agctcctttc tgetgagett ctcttctcca 3060 
tccagacagg cctctccccc agtctggggc 3120 
cgtggcctcc ccagggttcg ccactgaatg 3180 
ttggggatta gecaagaggg cagagttggc 324 0 
geggggtcta gacaccccct cctgaagggt 33 0 0 
tttagagtct ctcttttttt tttttttttt 3 360 
ggctggagtg cagtggcacg atctcagctc 3420 
attctcctgc cgcagcctcc cgagtagctg 3480 
taatttttgt atttttagta gagatggggt 354 0 
ttctgacctc aaatgatcct cccgccttgg 3600 
gccaccatgc ctggccttag attctctatt 3 660 
cttgttggct ggagagaaac tgettgatte 3720 
tatgeaaace gaccagtctt gagtccatac 3780 
tacgaaacca gtatttcccc tgccctaaac 384 0 
ttagagecca accccacgcc ccaaacccca 3 900 
tcttcccctg gcctgccttg cctccccagt 3960 
cttccactcg ctcctgattc tgtccctgga 402 0 
gcatggtggg aaactgtgag taataactta 4 080 
tcaccttcat aaattaaaat tttgeaacta 414 0 
actctcacct ggccccttgg gcttgtggag 42 00 
ctcagttttg ggatggtttt gttatgegge 4260 
etcttgeatg aagcaatggg agaatgtgaa 432 0 
agagggtgtt caaactaggc aacaccgtcc 43 80 
tgecatctag gctcggcctg ccagccacca 444 0 
ccccgccaac caccaccccg tgtggtcctc 4500 
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agggcacccc atgggttgaa tttacataca 
gtctttttaa aatttatttt agagacaggg 
tggtgccacc atagctcact gcagtgttga 
cctcctgagc agctaggcct acaggtacac 
tctgtagatg cggfcgtcttg ctgtgttgcc 
gatcctccta ccccggcttc ccaaagtgct 
agccaacatg tctttctttt gacttctact 
fcccccccgac acacacagaa tgggggagag 
gtgtagcact gggattgggg gtgggggctc 
gaagaaacag gaaggaaggt ccccaaaatt 
aggggagtgt ggactcagtc gaaccatagg 
tcatgccctc cacagagctg ggtccgtgga 
aggacagagc ccttttggga tggccagcac 
gggctgtgtg acctttgctt ctgggactga 
agtgttctcc agccctcagg aacttctaat 
gaagctccac tcggggaaga aacttccaag 
cactggctct gcccctgcag ttcaagttcc 
aaatctgttc attttaggag ctacctctca 
atctgctgag aatgaggagg atatgttgtc 
tgtgtggcag agagcccatc ctcactcagt 
ttctccctcc ccagcccagg ggcttccagc 
taaactcatt tccataataa cctgagccca 
ggtgggagag gtgggagggt gggcgatgag 
taccttgggg gtcaggcgag actgctggca 
gagagttctc agggtctggg agaaatggtg 
gagaccctgg ggctggggag gggcataggg 
gaaggaggca ggaggcatga gcccttccct 
cctcactcac ccaggggcca gggctgtcca 
acgccgccca gggggtggtc cctgtgtggg 
gtccaggagg ttggcacagt gatcctgagt 
agcccaggga gttctgcgtt tactgagttt 
tctcagctcc cttcctgctc cctcttcagg 
ctcatgccct cggctggcac ctcctctctc 
ccacccagca gcctggcact ctccagatgc 
atgtccgccc cttcggcagc tatgtctgct 
gatgattcca agccatcttt gactgtcccc 
gtgatgttcc ttcaaaacat tctcccctgc 
gccgccaccc tctgcagcgc cccctccgtg 
ctctgcccac agaccctcaa tcacaactcg 
gggcccaggt ggtgccctgc ccctgtctgt 
tcatgccatc tcccttggct tgcccggagc 
ccacagagtg cctgccctgc ctagggccct 
ttagtattat gtgactcccc actccgccac 
agggccctgg cacaggggca gccagcaccg 
gtgctgggcc cagcatttgg gaacttcaag 
gtggcaaagg gccgaaaact ggtgagacag 
tacagggact gcaatctgag ccagggaaaa 
ctggtgggag aagcaggaga gtggacaaga 
cttgcggtgc tggtggtggg gcagggatcc 
acctcctgtt caaagctact tctttccatg 
caaggctgag ggcctgcccc agctcccaca 
caggagtagg ctcaggaagc aggcgctcgg 
gcagccccag ctgcatcgct gcagatgggc 
gccggcccct gggagcagag gcagcaggac 
atggtctgcg gggattgggg gaaggggcgc 
cctgcccaca tgccccccac tggcatactt 
ccaccaaagc ctggcttgtc acagaagggt 
tggttgctat tgcaaacctg cagagaagag 
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gaaaagacta accaaggtcc aagttaatat 4560 
tcttgctctg tcacccaggc tggagtgcag 4620 
actcaggctc aagtgatcct cctgcttcag 4680 
accaccacgc ccagctaatt ttaaaatttt 474 0 
caggctggtg tggacctgct ggcctctggt 4800 
gggattatag gcatgagcca ccacgcctat 4860 
ttggtatctt ttcttaaatg gttccctctg 4920 
gctgtcagat tctgagctcc agaacctcag 4980 
aggaaccacc taggggagaa gacagggtgg 5040 
atgtttgttt gcagaggcca gccaggctgc 5100 
gccccaggac cactagcttc tggccagcag 5160 
aattgcatgt aggagacaca ccagactccc 5220 
tacccagcct ccactggtga gggaggtcag 5280 
tggtttattg agctggagag tgtgcccagc 534 0 
gtggctctgg gttcctggag tgggtgggtc 5400 
ctgcctgcag gtgctggagg tccggtgatt 54 60 
tggagtggct gtcagtggcc acctgtcttt 5520 
ccagaggcag gatcttggca tctggacttg 5580 
ccctaaggac tggggcccca ggctgcaagc 564 0 
gaggaccagt gatccaggaa aagccacagc 5700 
atcctggtct ccatgataac caagaggtca 5760 
gaaacctgat tagggggcag caaactgagg 582 0 
agggggaggc tttgaatcca ggtccctgcc 5880 
gaggcttctc agggtggctg ctgggctcat 594 0 
gagggtaaat gttgtgaata tggtcagcag 6000 
gactcaaggt gactgggtgc tgcccatctg 6060 
tctcccttcc ctctccacct ccccctggtg 6120 
gtggctgtgg ggcccaactc catggggtga 6180 
ccatctttgg ggctgagcaa cgtgataaga 624 0 
gggttattgc ctccccgcag catggtgtcc 63 00 
cttggggcac ccatctgctc caagtcaccc 6360 
ggagccttgg gatccaggct ccaagtgagc 642 0 
tagtcctaac atttcctcca ggctctgaca 648 0 
tggcatcgct cagcttccaa agaaccttgg 654 0 
ctccttgccc ctgggtgccc tgctgccctt 6600 
atcccatccc ccaaggcctt gtcatttcct 6660 
cctgagactc ccgcctgggg atgagaagca 672 0 
ctgacacgcc aggctctggc caccttgctc 6780 
ctttgtcagg gtcctcttag ctgccacccg 6840 
tatgccctct gcccccatct ctggcccaaa 6900 
actcccaaga ccaggctatg tcagacatgg 6960 
ggtgcagggt gagtcctagg acagccatgc 7020 
cacccaggtc acagagaact gggttaaggc 7080 
cagctgacca gtggtatgga gtgaaaagat 7140 
ggggtgacag aggtgatttg tgcagaggaa 7200 
aggctggaca ggcctccggg ggcagcatgg 7260 
acagggcgaa gtcaagggtg aggcagccag 7320 
ggagctgtac tgggaggtag agggccatgc 7380 
accccctcgc ttgactggga ggccactgga 7440 
gcctctgggg ctgctctctg cacctggggg 7500 
gccccagcag agctctgagg aggggaaccg 756 0 
agcctaccca ctgcacgcag ggtcccttct 7620 
tcctgggagt cggtagcaac accaggccag 7680 
gctgaggagc atggccagca ggaaggtgtc 774 0 
tgagtcctga gcaggtgcac caccccagct 7800 
tcagcctgca cagggccact gtccatgctg 78 60 
ggagcccagc ctggagcaca gtggcagtta 7920 
aagaggaggg tcacgtagga ttaggaaccc 7980 
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caaggtcacc cccactcctc gggctctcac 
ctgaagctcc tggaaggcat tcttcctgca 
caccaggcag tgagggggac actggcctgc 
ggcagagctc accattctag gtccacactg 
caggtccagc tgggcactgg ggagtgccaa 
taggtgaacg gtagagtcca ctggcaccat 
ctggcactgc agcttcccac acagggcatc 
ctgcagtacc ccccttcccc aaacccactc 
ctccctacct ccctgcacag ggcaggaagt 
gagcatctcc cgcagagttc accacctgga 
aggaagcatg ggccaggctg ggggcagctc 
ttcccctgcc catcttcccc acagtagaaa 
tgcaagggtg ctcgtgtcct ctcaccaggc 
gtgggacatg cgccatccca gcagtagcca 
aggtaaacgt ctgggggaca gtgggaggag 
tcaoccatgg cctggcggca cagcgctcca 
tggggaaggc agagagaggc cacgtgcagt 
gctgggggag ccaggcctac ccaagcccag 
taactaaggc caacaggccg gctcccaagc 
gccctcacca ggcagcgcac gcagcagtcc 
gagcagttgt gagcaaagca gcagaggtcg 
ggcccaggtg agggcgcagc gccccagacc 
agccgactta acctggccag ggccgcagtc 
cccgcagagc gccggcggca ccgggagtcc 
gccccccttg cggaagaagg cgcgcagctg 
aaacgggtgc ctaccggcac ggggagggca 
cgcgcttctc tgatccccac ccctgggctt 
agaggggaag taacccgcgc aaagtcacac 
aatagtgagc agcccgggac ccaaggtgga 
aggagtaacc tcgccaggtt actcggaaaa 
cctgagcttc ttccctttaa gcctcataaa 
ccccatttta cagaagagga aactgaggga 
caggactagc cgccccgccg cagccccgac 
atgacgcagc ctccggactc ggccgcagcc 
aggccgaggc tgtggccgat ctcatgggcc 
gagtggtcct ggggggccgt gggagggcgg 
cctcgccccc gcccgcgggg ctcaccgtgc 
tgccctcgac gggcgccagg cccactgtgg 
agcgcggcgt gaccaggcgg ggccgggagg 
ggtcagaggc acccacgtga gcagctgcgc 
ccggcgccac tgcaggaagg cccagagcgt 
gtcccgctcg gtccacacct ccaggccggt 
aagctgaggg cgaggcgggg ctgaagccgg 
tcgtggggcg cccttcctct tccccaaacc 
acccccatca ccgctctctc cccgccgccc 
aggagacgct gtttggtgtg gttcaagttt 
aatcccggtg gcttgagggg ctgagctggc 
ctctcctcac cagggtgtgg tctgccacaa 
gcgcttctcg cctgccctgc ggaggtgcaa 
tgagccctga cccccaaccc cagctcccag 
tttcttcaga cttcaataaa aatactggga 
atttggaggg ctatagatac agcattccca 
tctggggacc accaggaagg ctggtcatgc 
aggttccttt ccaggtgagc agctgctcca 
tggagccccg gggtggccag ggacgcagat 
ggccactagg gtgcagaggg gtaggagcgg 
ccagcctcct ctcccagagc tcacctcatc 
tcggggaagc cccttactcg cccttggtag 
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cccgtggctg tggcaggcag tcaggcagcg 8040 
gcgcctgctc tggcacacct acgggcagtg 8100 
gggattcaaa cggcaaggag gggtcgggtg 8160 
ggtgcctggc tctaccaggc ccaggccaag 8220 
ggctccccga caagtcactt cctggccatc 8280 
gtgcggtgcg agcaggctgg gctttccacc 8340 
cctggggagg aagtagaggg gggtcaacag 8400 
catagcttct gctccctcca ctcagctcca 8460 
ggccctcgct gtcctggccg cagtttccat 852 0 
aacaggcctc gggagctggg tgggagcctg 8580 
ggagaggggc tgcgctcagc gggcctcagt 8640 
actggcccca ccagaggatg gggggcaggg 8700 
ccccagagct gctggcactg ctgctccagc 8760 
ctgcccctgg cacagggtga gccgtccagt 882 0 
gtgcccgtgc aaaactcagg gaggtcacag 8880 
gccggcttca gctgcgcagg tgacgggtgg 8940 
gagaggtcca tgccgagagc gcggctcgga 9000 
cacccaacgg gggaacctga gggcaccaat 9060 
tccccgaaac cctcaccctg aaccttccat 9120 
ccgtgggcgc actgggcccc cgggcgcagc 9180 
cggcactcct gggaccagaa aggcaagaag 92 40 
tgagcggaga gggcaagtgg gggccgggcg 93 00 
acactcctcg cccgcttcca cgaagccgtt 93 60 
ggggtccggg gcattggaga ggcaagcgcc 942 0 
gcggcggctg caggcgctga acacgcgcgg 948 0 
ttgggcatgg agggacagtc ccccaacccc 954 0 
ggctacagcc gccagacgcg cagagcccag 9600 
aacaagcggg acaggggacg atgcggcccc 9660 
atcgcgaccc gacggtgctc ctcccggtgt 972 0 
taatcttcat accgttgaga atccactttg 9780 
ccaccctgaa gcggacacta tgatcattat 9840 
cgaccaaaga aacgcagcgg aggaagtccc 9900 
cccccacccg cgtacccggt ggccgcagcc 9960 
tccacgcagc agccgtcggg gtcgtggctg 10020 
atggtggctg cggcgccgat ggggagctcc 10080 
tcactgcggc cgtagagcct cctgtctctc 1014 0 
tcacgcctcc cgagctctcg gcgcggcaca 10200 
cgccctggaa ggcgcggccc ctgggggcgg 10260 
tgaggccgcc ccacccggga cccgcgtccg 10320 
ggagtcgtgg ggccgctgcg cccacagccc 10380 
ggcgttggcg tcctgcgtga cgcggctgcg 10440 
cagcgccacc tgaatgtcca gagtcctgag 1050 0 
gacagggcgc cccatcgcgc cggtggtcct 10560 
ccaccagcac ctgcctgtcc tgccgccgcc 1062 0 
ccaacctggt ccacgtagtt ggcgacttcc 10680 
cggtgccgag tcaagaactg ggaaggcaga 1.074 0 
cccatccctg accccgccaa cccctggggt 10800 
tgtacagttc caggtacttc cgggtcctgc 10860 
atggggaccc tgagtggaag ctgctgggct 10920 
aaggaagttt aacatgtttt ctggaacttg 10980 
ctcgaggcct gtgaattcct gtctcttctg 11040 
ctcccatccg atcgatgccc ctgaccctgc 11100 
ccgctttgtt cccaggatcc ctgtggccac 11160 
tccgaaagat ctcgtgggtt gagaagtcct 1122 0 
aatagctggc attcctgctg agggtgatca 11280 
gtgtgaggga gctctttccc catcccaggc 11340 
ccagagcagg tgcagaggac tacccaggag 11400 
tggcaatgat cctagggagg aaggggccag 11460 
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ccccaaatct cagccagggc tggagcaaga 
gctccaaccg ccccttagga atgcaaggag 
tctggcggtg catcccagag cccatggaag 
actggctgcc catctgggcc gtagtgggtt 
agagggggtg ttacagggaa cactgaattc 
ccagccttcc tccctaaatg ctaatggagc 
cttagggaag ggacaggagc aatggtgact 
gaaggagcct tggtgatggc ttagttgtgg 
acgccgggga ggagtgggaa taggggaaga 
cagcaggatc cacaggatcg ggaggggagg 
tggagttggg gaaatgcggt gacctccccc 
ggcatctggt cgaccctctt gcccccacct 
cttctagacc ccacatacct cccacctgtc 
cagagaaaac agaatcgcca cctgcccacg 
ccactccggt aatgattcca tcttcaggct 
ggatcaatca taagtcaatc • tgtcttcttt 
gcctcattac ctccagacca cccgctaatg 
ccacctgccc cagctctgcc ccctacccct 
aggtgatgtg accaccccat tgaaagggta 
tggaactaga gtttgttccc tgtccatccg 
agtgctcaga acagcgcctg cggtcaagca 
tctgggtggt tttaagagcc tgcgctccag 
accttcctga gcacaggtct gcctgtccct 
gtccaggaca ctgagcacca aaatggcctt 
attcattccc ccctgctccc ctccctgtag 
tgctgccccc aagtgtagcc actgcatctt 
tctttttctt ttttttctct ttttttgaga 
ggtgcagtgg tgtgatcttg gctcaccaca 
ccgtctcagc cccctaagta gctgggatta 
ttgtattttt agtagagacg gggtttcacc 
cctcaggtga gccgcccgcc tcggcctccc 
ccttagcctg gggtgccggc tcatcttttc 
ttgcaaatgt ccactaggtg gtgctcactc 
tggggttggg agggcggagg tgccccacag 
gcctccaggg agccaaccac aggcctgcgt 
ggtcctctgc ccatccagcc acccgcacag 
atcccctcct cccagcccta ccccagcctg 
ggagctcctg gccttcagcc tccagggcca 
caagaaagag tgtggagctg agatggtggc 
tggcctgtgg agccggctgg ggaggaagtt 
aggacccccc acatactatg gagggcgtgg 
aactcaggat gtttgcagtc ctgacaacag 
aggcacacca gcatcctccc tagagaccag 
ccgccctctt ctttcatgcc cagaaggggc 
gaggacatta tgggggcatc gggtagccat 
tgaggcttgg ttccaaatga gctgctgctc 
cacttccaat tttggcaaac tgctccctct 
ccgtccctcc ggctccgaca cctgcttcac 
ccagccaaga gagcacactg gctctcagct 
ccctctctcc ttcccccacc ctcttggcgt 
ccttgtccac tcagcctggc tgggggccca 
aggctgctgc gccttcggaa tgtttaaccc 
ctggggccct gtccaagggt ctcagagtct 
agcagatggt ccttgcctcc ctggccacag 
aggctggaat gccacttgct ttattgggga 
gaggtccctg ttcctctgca gcccctgcta 
ccttgatata tacccctcat cttgaaatgt 
tagcctctac tctttcttct gtaccatctt 
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ggggcaagag ggagggtgtg gtaggggctg 1152 0 

gagtaggggt aggaatggtg ggggggtacc 11580 

catctcaccg tgtggttggg ggccagcacc 1164 0 

tctatgtatc ctggggccag cagcctgctg 11700 

agcttcctcc tgcctcctcc aggatgtctc 11760 

agctttatga gtgagacact cacagtgtgt 11820 

tgctcagatc agaaactctt ggggctagag 11880 

gagatgtgaa tatgggaagc accagggagg 1194 0 

gtttgtggtt cccaggggac ctgcagcagg 12000 

agtcaggaga cactgccgaa gaatgggact 12060 

agttcccctg cctgctgccc tcctttg'ttg 12120 

gccctagatc cttgaaatat tttcctcaga 12180 

cttcagtgat tgatgctcac cccctgcctc 1224 0 

ctgcttccac cctccctgcc ttctccacac 12300 

ccatctcaac aggatctttc ccacacggat 12360 

aaagaaaatc cttaacccaa cctcaccttg 12420 

atggctgctt cccccctccc aggcattcca 12480 

gccccacaca cacccgccac cctaggaggt 12540 

gggacgtcgg gaaaatatgg ttgggcacag 12600 

actccacgag ggagaataaa atacgtgtca 12660 

ctcagtaggt gatatatact gataacataa 12720 

cccggacacc caccccaccc agcccaaggc 12780 

tccccagctc aaaatctttg gctctggatg 1284 0 

ctgtgatctg gccctcctgg gactctccaa 1290 0 

atggagacct tccaggcagg tcagacaaac 12960 

tttctttttt cttttctttc tttttttttt 13020 

cggagtctca ctctttgccc aggccggagt 1308 0 

acctctacct ccggggttca agcgattctc 1314 0 

caggcgccgc caccacgcct ggctaatttt 13200 

atgttggcaa ggctggtctc aaactcctga 13260 

aaagccaccg catcttggtc cctgccattc 13320 

cctctaggat ttctttagac tcagcatatc 133 80 

atcgccagca gggagctaac aagccgctcc 1344 0 

cggggctgac agcctcagcg gtcctcttca 13500 

gactctccct gtcatctgca ccctctctgg 13560 

atctgtgtca gtccctgccc cccaacactg 13620 

gcactcactg gttcttctcc agctcaagca 1368 0 

ccagccccat gtctggcttc gagacctggg 1374 0 

ctccaggcct cctgcctgcc agggagtagg 13800 

cttggggaga acgtgggctg gggagtcagc 13860 

aggaggtgag aacatacaaa gatgttccca 1392 0 

ccacttggaa gggcgttggc acagcctgcc 13980 

aggtcccaga aaggtgcccc tcccctggcc 14 04 0 

atcaaaagca ggggaagaca gaggggtgct 14100 

ggtcagggcc tcctcagagc ctctgctacc 14160 

atttcctata gaattcaaat ttgactcctc 1422 0 

tccaaagttt tcctgggcct ccagcagccc 142 8 0 

tggacccacg aagtaaacat ggacgccatt 1434 0 

aggtgtcagg aggctggctt ggacggccag 1440 0 

ctcccaccct gtgggaacac cccacttccc 14460 

gagttggagc cggcccagga gcttcctggg 1452 0 

ccgactcctt ttctccaaaa atgcactggc 1458 0 

ttggagggag ttcttccttc gcaagtgggg 14 64 0 

gccccacaag gcctccagca tgagctcatg 1470 0 

aaggtctgca ccgggaaaaa ggccatactc 14760 

tctttactct tgccctcctg gtaccctgcc 14820 

gagtgtttcc tgccttttgg aggggatacc 14880 

ggcaggcttc ctgggggcag gggcccaccg 1494 0 
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gtgggggaag cagagcccct ttggggctct 
agggaggccc agaggcagag tgaccccagt 
ggagggcaat ctcaaagctc aggccagtgc 
gggcctaggg gatcccagtg agggccctgg 
tcagccttca tggcaatgct cccctgcctt 
aaaattccaa acaaagaggg ctctctagga 
cgttcagacc ttagtgggtc tcaggctccc 
ttccccaagc tctctgggct tccgtttgcc 
gccctcttag tttgcagcac agatattatg 
aagcgtgtga agagcgggat tgggaagagg 
atggggtggt cttacatctg ggggacatca 
aagtggattc cacaggagcc agtgaggctg 
gagggacagc agtgtgtcca cagggtcaca 
ttttggtgct tttttttttt tttttttttt 
aaactctgga cccatttctg cctggtgggg 
•agggaaagag ggaggtgggg tggagaaagc 
agatccagat gtccaaaccc cagtgccttc 
gggcctatag gagtagtgac ttggtggttc 
tgtagggaga gacaaaaggg ctgggaggtt 
gcactgaccg gctcctccag gctgacggtg 
ggggtgactg gctgcccagg gatatgtcct 
acctgaggaa cacaggggca tgggacaaag 
ggaggcactg gaggcctttt ggggcagact 
agaaggaggg acgcccgtgg gcatccttgg 
agaaggctgg aaagggacag ctcagctggg 
ggcaggcttg ggctcggcga tcacaaggaa 
gaaagagcgc ggcagcctta gggattttta 
atgtacgaag agaggacact tgtgcccccc 
ttgccttgcg aggtccacct tgcccagagc 
ttttggtacc ctggcccaga aggcccctcc 
gacggagcct tggtgctccc tggccgggtg 
agccaagcat tactcagagg gcgcctggaa 
ctggccccaa ccactctcac aggcccgccc 
ctgcagcagc cctccttccc ccggtaccca 
cctccacgca gccctggcct cagtcctgct 
gctcctgcta ggccacccca actccttttt 
catcaaagac ccttcctcca agaagccctc 
agatgagggc cgagaggagt ctgccacctt 
agggctctga gtggatgtgg gggaggggaa 
ttaactgaga gtagagaacc aagctttgct 
ggtagggtgg gcccgggcac aggtgtggtg 
acatggcctt gcgcgcacgt gctggccacg 
gcagagctgg agctggagct ggagctccgg 
gagccaggct ggtattgggc acatagacgc 
gaccccagga tagggctcag aggggaggga 
gccagaacag accaggcccc tgtacctgtt 
gcccagtctg catagcacag cccagccccg 
cccggccagg cacttgctgt gcaggactgg 
tccagcccca tcttaacatc tttgtttgga 
ttccagagca ctgctaaaca gacaccttct 
acaaggcaac aatataaaca acatacacac 
ggtgagcagg gtggaggcca cagctagttc 
gggactcttg ggcatggggg cttcctagag 
aggagcagga gcgggtcaca cggtggcctg 
ccaggcagag ggggctacgg ctatgggctg 
gagggagcca tgccctctgc tttgccagcg 
gggcccgggg ccagcccaag cacacttgag 
ccaaagagtc cccacgccct gacctactgg 



71/154 

cctcttggtc acagcccagg ccagacagac 15000 
gtgtgtccag ccttcccctc ctggggatgg 15060 
cgtgcttgac cagtggaatg ggggccttat 1512 0 
gttgggagct gctgggtctc tgggggcctc 15180 
ccctcttgct ggatttggac agtagggctg 1524 0 
ggggcagggg tgtagccaat ggtttaaaat 153 00 
agcctaaaga gctgtgtgac catggacaat 15360 
cctctgtaaa atgagcatat caaggctact 1542 0 
gcacaaacag atggggcatg gttattctgg 1548 0 
ctggggcaga gcgtcctgca gaagaagcac 1554 0 
ggagagtgac cactgccccc cccataccag 15600 
aaggttcagg ccttcgtggc agggccctga 15660 
tgttctggtc aactttgcaa aaggttttct 15720 
tagaggctcc tgaaaagctt caggacccac 15780 
gtgggggtgg cccagatcat ccagggaggg 15840 
tgaaatgact tccatgtgtg cgggctcacg 15900 
ttctgcccac ttgaggggca ggggaggcag 15960 
tggggacccc agcaaaacta gaagctgtaa 1602 0 
cagggcccct gtggagggcg gggagacatg 1608 0 
cgccagggtt gtccatccag gacccagtgc 1614 0 
ggagtaaaga cagagcacag ggtgaggggg 16200 
cagagggagg ggggtagagg acatccccag 16260 
tcaccttcaa cacgcgtggg ctcagcctgg 1632 0 
atctgaggag ctatcaagga ggaggaaaag 1638 0 
gacacgggag tcccctgacc tttgtcgggg 1644 0 
gaggccaagg ccgccagtgc agaggggaag 16 500 
gatgggcagc agatgccttt agggtgagag 16 560 
ccatcatctg agaaaaacaa cagccagatg 1662 0 
tccctcgggg actctgtcct ggtggcaggg 16 68 0 
tcatctcttc aaggggaggg gacgcttccg 16740 
tgcctaaggg ggctctagga ggaatcccag 16800 
tgttcccctg gaatgctccc agcccctcca 16 86 0 
tgcaggagcc aggccccagg cacccagagc 1692 0 
gtcccagctc ccagaacaga cagcctcccc 16980 
gggctgatgg ctgcctgtgg aagtgactca 17 040 
tctcctccac cttctctccc agactacaaa 17100 
cttgattgga tgagtgaatt gccatcaggc 17160 
ggaaaggagg ctagaggggc cagtgcaggg 17220 
ggaggggagg tctctcagcc cagagagcac 1728 0 
gctcctaggc ctctaagggt ttggggaaga 17340 

tgggtgcagt gtggtgtgtg ggtgctgtcc 17400 

ggcaccctga ccccaatgag ggagagaggg 17460 
tgaccgggtg aatgggggtg gaacccgagg 17520 
ccctctccca ggggtcccat cacctcccct 17580 
gcagtggacc gcctggggcc ctcccctggg 17640 
tggtccccac acagtgctgt ggaagccacc 17700 
catgccccct ccctggttgc cctccctgtt 17760 
ctaatcctcc cacccgcttg cagaggttgt 17820 
ggggttaccc cgaggagaca gctgcagtct 17880 
atctggagag gcccttctct atctcaccaa 17940 
tgccctgctg ccctgggagg agggacgagg 18000 
tgcagcctga gagcaaagca gggactctgg 18060 
gatggagccc cgctgagtcc taaggggtgg 18120 
cggatggaag ctggttgtga gagcgagaat 18180 
ggggctgggg gctgggctgt ccccgagggg 18240 
gagtggcagc cgggcagtgt gggcaagtcc 18300 
cgtccctggg caggtcccac ggagaccccc 183 60 
ccgtatggtg ccggggccgt gagaccctcc 18420 



WO 01/78894 



PCT/US01/12245 



gcgcgctgac ccgagctctg agcagaaccc 
cccctcaggg cgcaccccgc ccgcgtcctc 
ccagagcagc agcagtagta gcagcagcag 
ccagcccata gctgtgagct cctcggcctc 
caggctggat tgcggccgac ccgtgcccgg 
cccgccccca cctcccaccg cccgccctcg 
cggagcctgg ccaggggccc cgacggcgcg 
gaccgccgcc cctcgagggg gtggagctgg 
atgaccggcc cgagccgtcc ctccccgtcg 
gagagaaccg ctgggaaggt tcttggagga 
aaggccgcct ctcgctgctg aagttggccc 
gccgatctct gtcccctcct tgctacccgt 
agtcgggtca tccacctgca ctgggtgccc 
gccgctgtct tgggggcttc attgctgggc 
tcttccctgg cggtggccag agggcagaat 
gagacccttt cggcctcagt aacatctgtt 
gtgcaaattt ggatgagatg ttaacttttt 
ctctgtcgcc aggctggagt gcagcggcgc 
tggttcaagc gattctcctg cctcagcctc 
ccacccccag ctaacttttg tatttttagc 
tggtctcaat ctcctgatct cgtgatccac 
caggcatgag ccaccgcgcc cggccggaga 
tttttttttt tgagacagag tttctctctt 
ctgggctcac tgcaacctct gcctcccaga 
agtagctggc attacaggca ttcgccacca 
atggggtttc tccatgttgg tcaggctggt 
gcctcggcct cccaaagcgc tggaattaca 
aagtaaatct atttgttttt gagaatttgg 
gcagtgagag aaacctacat tcacatatct 
aaagtctccg tgctcttagc cctcctcacc 
ataggctttt agattatttt ttcacactca 
taacgttttt ctgcactttg attttctttt 
tcagtgcgtt tagagggtcc ttgttgttct 
tctgtcaccc acgctagagg acagtggcct 
gggctcaagg tatcctccca cctcagcctc 
ccaggcccag ctaaaatttt taatttttct 
gctggtctca aactcctggt cttggccagg 
tttgggaggc cgaggcgggc agatcactgg 
catggtgaaa ccccatctct actaaaaata 
ctgtagttcc agctacttgg gaggctgagg 
ggttgcagcg agccgagatc atgccattgc 
cgtctcaaaa aataaaaata aaaataaaaa 
tcagcttctc aaatcgctgg aattacagga 
ttattactta ttattactac tttggatttt 
cacaataaaa tgcacttatt ttaagtggcc 
ctacataagc atcactataa tgcagacaca 
cttccagcca aacttcccca ctcccaaccc 
tatagataag ttttgcctgt tctagaattt 
ttctgtgtct ggcttccttc cctctttccg 
catatcaata gtttgttcct tcgtattgct 
agtttctcta ctcaccagtg caccataggg 
gaactatatt tgcacagaga gagagagagg 
caattggctc acgtgattat ggaggccaaa 
agaacgagga aagccagtgg tgtgattcag 
accagtatgg aggtggctcg agctcagaac 
ccagagcagc ccctcagcga cacctcttca 
cttcagtgtg gatgtcaggt acagaaggca 
actggtattc ttacctcgaa ctggccaact 
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atccccgcca ccaccaccgc gcctagcctg 18480 
accttgaagc accccggcgc ctggcactgg 18540 
caacggggtc ccccgagctc tccggggcct 18600 
taggcagcgg ctcgcaactc cggctccgcc 18660 
tgcagcctca ggccgccgcc ttcggacctt 18720 
ctcccgcctc ccctccccgc caaccccgct 18780 
cgccatgggg gagccgggtc gccactcccg 18840 
gcggaggagg gaatccgtgc ggcccctcgg 18900 
gtctcagagg gcctctactc ctgagaggag 18960 
ccgcggcgtg gtgggatgag gcggtgggca 19020 
caggagcgcg atcttccgtg gtctcctggg 19080 
cctgccccga gggtgccctg gcggaggttg 19140 
ccaaggatag gaaggttcag gcaaccggct 192 00 
aaaggcgatg cagcagacgg agacaacctt 19260 
tgcataaaag ctgcagactc ccaggcctgg 1932 0 
tcatgtttta aacttttgtt ttcctactcg 19380 
tttttttttt ttttttgaga tggagtctcc 19440 
gatcttggct cactgcaacc tccgactccc 19500 
ccgagtagct gggactacag gcgcgcgcta 19560 
agagacgagg tttcaccatt ttggccagga 19620 
ccgcctcggc ctctcaaagc gctgggatta 19680 
tgttaacttt taagcaaatc tttttttttt 19740 
gttacccaga ctggagtgca atggcatgat 19800 
ttcaagtgat tcttctgcct cagcctcccg 19860 
cgcctggcta attttgtatt tttagtagag 1992 0 
ctcgaactcc cgacctcagg tgatctgccc 19980 
ggcgtgagac accgcaccca gcctactttt 2 0040 
aatgtagtaa tttggttagt gaaagttcga 2 010 0 
caaaatcaaa aagtacagaa agcataggga 2 0160 
aacaggaaac caatatgatt agtttctttc 2 022 0 
agacaataca gacatatttt tttctcttat 20280 
tttttttggt cgcttaatac accttagata 20340 
tatgattatt atttagagac agggtctcac 2 0400 
gatcatgcct cattgcagcc ttgaaatcct 2 0460 
ctgagtagct ggaactacag gcacacggca 20 520 
gtagacaggg ggtctcactt tgtttcccag 2 0580 
cgcagtgtct catgcctgta atcccagcac 20640 
aggtcaggag ttcaagacca gtctggccaa 2 0700 
caaaaattag ccgggcatgg tggtgagcgc 20760 
caggaaaatc gcttgaactc agaaggtgga 2 0820 
actccagcct gggcaacaag agcgaaactc 20880 
gaactcctga tcttaagtga tcctcctgcc 2 0940 
gtgagtcacc acagctgtcc agctacgaga 21000 
caaatcaact tcattaaggt ataatttaca 21060 
agtaagatga gtttcgataa gtgtatataa 2112 0 
ttccctcact cacagaaaga gccctgtgcc 21180 
cagacagcca ctgatctgtt gttctctgtc 21240 
catataaatg gaatcatgcg gcatgcactc 21300 
atgtttttga gattcattta cactattttg 2136 0 
gaatagtgtt cggtggtttg agggaaccac 2142 0 
ttattttcca gttaggggct cttataattg 214 8 0 
aagaaagagg gagagagata tttattatag 2154 0 
aagttcccga atctgccatc tgcaagctgg 21600 
tttgagttca aaggcctgag aaccaggagc 21660 
aagttgggga caggaaagca gagcagcacc 21720 
gtaaagcaag gctgaacaca gaggggctgg 21780 
gctcgaggag ctactctggc gttcttgctt 21840 
cctacttaaa ctgcaggcca tggctttaat 2190 0 
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gtcctgtcat tcagaggctg tcccttaccc 
ttctccctgc aacacgtttc agaaggccct 
aagctcctga gctccatgtg gtctgggaat 
gtgctaataa ctgactgtag aggggacaca 
ccttacacag ttgacacctc ccaaacccag 
attttctgcc gaagtctctg agaaagggtt 
atgaggccat caaaccaaag ctgaggaaga 
gggcgcctga cagcacctgg gaaagatggc 
agaggggcat caggaaatgt cctgggaacc 
gtgcttggca tgggtgacac ccaccatctg 
gaacaggatg tctgagggag gcactccgac 
ctggtgacga caggctgatg cttgggtgcc 
cccaaagcca acgcttcatc cctgacagcc 
ttggcaaggt gggtggaatt tcaggttcat 
ttgtcaaaca gttttccaaa gtcattgtac 
aaaaacaggc aaatcaaatc tatggtgttc 
gctgcaactg gtacagagtg taagggggga 
gtgttcgctt ttggaaaagt ccttgagttg 
tacactttaa ttgaatgtac aaaaagtctc 
aactaacccc tctgagagcc tccaatccaa 
gaagcaggtg aagactccta agccggatat 
gcagctgctt atggtcctcc ctacaccctc 
tgcatttgac ttgctctctc cctgtcccac 
cccactgtca gcaatgtaga caagacaaac 
catgcacggg gctggtcatt gtgggtggca 
cagaaaggga cacaaatcac ttcagcgtaa 
cagcccctgg ccccctccta cagatggtgt 
ctctgttcta gcccagctgt ttgggttgca 



aaagccaggt tagcatcccc tgactgacac 21960 
gtagtcgtcc acttccctgt ctctctcccc 22020 
atgtgtgttg ctcacttcct agcacagtca 22080 
gtcgaaaagc cacatgggga tcagagtcat 2214 0 
atgagctgtg tccaagtgca ggtcagagga 22200 
tatttacatt ttgaggttgc aggggaggag 22260 
gggatcctag gatgcaccga gcagctccgg 22320 
ttctccactg gcttgttggc gtcaccctcc 22380 
aggcaaacca gtgagcatta acccttagaa 2244 0 
taaacacgac ttctcccaag gagtgacgca 22500 
tccagccttc agagatcgcc agggtggcac 22560 
ccagaaaagg tcatgtgtgt gaatgggggc 22620 
tggtgcattt agaggggaac tttfctgtccc 22680 
agggcaaggg tattttagct ttaatagata 22740 
acactctgtg attctactta tgtaaagttt 22800 
gaagtcaaga cagtagttac ccttgtgggg 22860 
ctgtaggatg gtctatttct tgatctgggt 2 292 0 
catttataat gtgtgaactt ttctgtatgt 22980 
aggaggcctc agaccactgg aagcggacac 2 3040 
gatggacata tgtccccttg gaagtatgca 23100 
tcccaaatcc ccccagtagc cgcagcttca 23160 
tcttccccag acagccccca aacatctggc 23220 
ctctggattt agtccatgtt ctccaccctc 23280 
gcttagttca cgtgcccacc tactgcgtgc 23340 
aatgtgagca acacacgaag cctcaaggag 23400 
ggtaatttgt gataaatgtc atgtaacttg 2 3460 
ctaagaataa accccactaa catgtgactc 2352 0 
agaaagagac tcactccagt tgcg 23574 



<210> 7 

<211> 65 

<212> DNA 

<213> Homo sapiens 

<400> 7 

agtctccgtg ctcttagccc tcctcaccaa caggaaacca atatgattag tttctttcat 60 
aggct 65 



<210> 8 
<211> 656 
<212> DNA 

<213> Homo sapiens 
<400> 8 

gtgcagcctc aggccgccgc cttcggacct 
gctcccgcct cccctccccg ccaaccccgc 
gcgccatggg ggagccgggt cgccactccc 
ggcggaggag ggaatccgtg cggcccctcg 
ggtctcagag ggcctctact cctgagagga 
accgcggcgt ggtgggatga ggcggtgggc 
ccaggagcgc gatcttccgt ggtctcctgg 
tcctgccccg agggtgccct ggcggaggtt 
cccaaggata ggaaggttca ggcaaccggc 
caaaggcgat gcagcagacg gagacaacct 
ttgcataaaa gctgcagact cccaggcctg 



tcccgccccc acctcccacc gcccgccctc 60 
tcggagcctg gccaggggcc ccgacggcgc 120 
ggaccgccgc ccctcgaggg ggtggagctg 180 
gatgaccggc ccgagccgtc cctccccgtc 24 0 
ggagagaacc gctgggaagg ttcttggagg 300 
aaaggccgcc tctcgctgct gaagttggcc 360 
ggccgatctc tgtcccctcc ttgctacccg 42 0 
gagtcgggtc atccacctgc actgggtgcc 480 
tgccgctgtc ttgggggctt cattgctggg 54 0 
ttcttccctg gcggtggcca gagggcagaa 6 00 
ggagaccctt tcggcctcag taacat 656 
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<210> 9 

<211> 177 

<212> DNA 

<2i3> Homo sapiens 

<400> 9 

cgggcacggg tcggccgcaa tccagcctgg gcggagccgg agttgcgagc cgctgcctag 60 
aggccgagga gctcacagct atgggctgga ggccccggag agctcggggg accccgttgc 12 0 
tgctgctgct actactgctg ctgctctggc cagtgccagg cgccggggtg cttcaag 177 



<210> 10 

<211> 80 

<212> DNA 

<213> Homo sapiens 

<400> 10 

gacatatccc tgggcagcca gtcaccccgc actgggtcct ggatggacaa ccctggcgca SO 
ccgtcagcct ggaggagccg 80 

<210> 11 

<211> 77 

<212> DNA 

<213> Homo sapiens 



<210> 12 
<211> 79 
<212> DNA 

<213> Homo sapiens 
<400> 12 

caggctgctg gccccaggat acatagaaac ccactacggc ccagatgggc agccagtggt 60 
gctggccccc aaccacacg 79 



<210> 13 
<211> 119 
<212> DNA 

<213> Homo sapiens 
<400> 13 

caggctgctg gccccaggat acatagaaac ccactacggc ccagatgggc agccagtggt 60 
gctggccccc aaccacacgg tgagatgctt ccatgggctc tgggatgcac cgccagagg 119 

<210> 14 
<211> 77 
<212> DNA 

<213> Homo sapiens 



<400> 11 

gtctcgaagc 

gagctggaga 



cagacatggg gctggtggcc ctggaggctg aaggccagga gctcctgctt 60 
agaacca 77 



<400> 14 
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gatcattgcc actaccaagg gcgagtaagg ggcttccccg actcctgggt agtcctctgc 60 
acctgctctg ggatgag 77 



<210> 15 

<211> 190 

<212> DNA 

<213> Homo sapiens 



<400> 15 

tggcctgatc accctcagca ggaatgccag 
ctccaaggac ttctcaaccc acgagatctt 
aacctgtggc cacagggatc ctgggaacaa 
ccagagcagg 



ctattatctg cgtccctggc caccccgggg 60 
tcggatggag cage tgc tea cctggaaagg 120 
agegggcatg accagccttc ctggtggtcc 180 

190 



<210> 16 
<211> 66 
<212> DNA 
<213> Homo 



sapiens 



<400> 16 

ggcaggegag aagcgcgcag gaeceggaag tacctggaac tgtacattgt ggcagaccac 60 
accctg 66 



<210> 17 

<211> 72 

<212> DNA 

<213> Homo sapiens 



<400> 17 

ttcttgactc ggcaccgaaa cttgaaccac accaaacagc gtctcctgga agtcgccaac 60 
tacgtggacc ag 72 



<210> 18 

<211> 167 

<212> DNA 

<213> Homo sapiens 



<400> 18 

cttctcagga ctctggacat tcaggtggcg 
gaccgcagcc gcgtcacgca ggacgccaac 
egggggctgt gggcgcagcg gccccacgac 



ctgaccggcc tggaggtgtg gaecgagegg 60 
gccacgctct gggccttcct gcagtggcgc 12 0 
tccgcgcagc tgctcac 167 



<210> 19 
<211> 85 
<212> DNA 

<213> Homo sapiens 
<400> 19 

gggccgcgcc ttccagggcg ccacagtggg cctggcgccc gtcgagggca tgtgccgcgc 60 
cgagagctcg ggaggcgtga gcacg 85 



<210> 20 
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<211> 143 
<212> DNA 
<213> Homo sapiens 



<400> 20 

gaccactcgg agctccccat cggcgccgca gccaccatgg cccatgagat cggccacagc 60 
ctcggcctca gccacgaccc cgacggctgc tgcgtggagg ctgcggccga gtccggaggc 120 
tgcgtcatgg ctgcggccac egg 143 



<210> 21 

<211> 178 

<212> DNA 

<213> Homo sapiens 



<400> 21 

gcacccgttt ccgcgcgtgt tcagcgcctg cagccgccgc cagctgcgcg ccttcttccg 60 
caaggggggc ggcgcttgcc tctccaatgc cccggacccc ggactcccgg tgccgccggc 120 
getctgeggg aaeggctteg tggaagcggg cgaggagtgt gactgcggcc ctggccag 178 



<210> 22 

<211> 90 

<212> DNA 

<213> Homo sapiens 



<400> 22 

gagtgccgcg acctctgctg ctttgctcac aactgctcgc tgcgcccggg ggcccagtgc 60 
gcccacgggg actgetgegt .gcgctgcctg 90 



<210> 23 

<211> 196 

<212> DNA 

<213> Homo sapiens 

<400> 23 

ctgaagcegg ctggagcgct gtgccgccag gccatgggtg actgtgacct ccctgagttt 60 
tgeaegggea cctcctccca ctgtccccca gaegtttace tactggaegg ctcaccctgt 12 0 
gecaggggea gtggctactg ctgggatggc gcatgtccca cgctggagca gcagtgccag 180 
cagctctggg ggcctg 196 



<210> 24 

<211> 107 

<212> DNA 

<213> Homo sapiens 



<400> 24 

gctcccaccc agctcccgag gcctgtttcc aggtggtgaa etctgeggga gatgetcatg 60 
gaaactgegg ccaggacagc gagggecact tcctgccctg tgcaggg 107 



<210> 25 

<211> 199 

<212> DNA 

<213> Homo sapiens 
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<400> 25 

ggatgccctg tgtgggaagc tgcagtgcca 
catggtgcca gtggactcta ccgttcacct 
cttggcactc cccagtgccc agctggacct 
ccagtgtgga cctagaatg 



gggtggaaag cccagcctgc tcgcaccgca 60 
agatggccag gaagtgactt gtcggggagc 12 0 
gcttggcctg ggcctggtag agccaggcac ISO 

193 



<210> 26 
<211> 109 
<212> DNA 

<213> Homo sapiens 



<400> 26 

gtttgcaata gcaaccataa ctgccactgt gctccaggct gggctccacc cttctgtgac 60 
aagccaggct ttggtggcag catggacagt ggccctgtgc aggctgaaa 109 



<210> 27 

<211> 148 

<212> DNA 

<213> Homo sapiens 



<400> 27 

accatgacac cttcctgctg gccatgctcc tcagcgtcct gctgcctctg ctcccagggg 60 
ccggcctggc ctggtgttgc taccgactcc caggagccca tctgcagcga tgcagctggg 12 0 
gctgcagaag ggaccctgcg tgcagtgg 148 



<210> 28 

<211> 92 

<212> DNA 

<213> Homo sapiens 



<400> 28 

ccccaaagat ggcccacaca gggaccaccc cctgggcggc gttcacccca tggagttggg 60 
ccccacagcc actggacagc cctggcccct gg 92 



<210> 29 

<211> 72 

<212> DNA 

<213> Homo sapiens 



<400> 29 

accctgagaa ctctcatgag cccagcagcc accctgagaa gcctctgcca gcagtctcgc 60 
ctgaccccca ag 72 



<210> 30 

<211> 1031 

<212> DNA 

<213> Homo sapiens 



<400> 30 

cagatcaagt ccagatgcca agatcctgcc 
gatttaaaga caggtggcca ctgacagcca 
cagtgaatca ccggacctcc agcacctgca 
gcttcgaccc acccactcca ggaacccaga 



tctggtgaga ggtagctcct aaaatgaaca 60 
ctccaggaac ttgaactgca ggggcagagc 12 0 
ggcagcttgg aagtttcttc cccgagtgga 180 
gccacattag aagttcctga gggctggaga 24 0 
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acactgctgg gcacactctc 
agcccctgac ctccctcacc 
gtcctgggag tctggtgtgt 
catgactgct ggccagaagc 
cccctgcagc ctggctggcc 
tcttcccacc ctgtcttctc 
tacacctgag gttctggagc 
ggggacagag ggaaccattt 
tggctatagg cgtggtggct 
ggatcaccag aggccagcag 
acagaaaaat tttaaaatta 
ggaggctgaa gcaggaggat 
caccactgca ctccagcctg 
aagacataaa a 



<210> 31 
<211> 78 
<212> DNA 

<213> Homo sapiens 
<400> 31 

gtgtgccaga gcaggcgctg 
gcctgccaca gccacggg 



<210> 32 
<211> 6 
<212> PRT 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: polyhistidine tag 
<400> 32 

His His His His His His 
1 5 



<210> 33 
<211> 8 
<212> PRT 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: FLAG epitope tag 
<400> 33 

Asp Tyr Lys Asp Asp Asp Asp Lys 
1 5 



<210> 34 

<211> 22 

<212> DNA 

<213> Artificial Sequence 



cagctcaata aaccatcagt cccagaagca aaggtcacac 300 
agtggaggct gggtagtgct ggccatccca aaagggctct 360 
ctcctacatg caatttccac ggacccagct ctgtggaggg 420 
tagtggtcct ggggccctat ggttcgactg agtccacact 480 
tctgcaaaca aacataattt tggggacctt ccttcctgtt 54 0 
ccctaggtgg ttcctgagcc cccaccccca atcccagtgc 600 
tcagaatctg acagcctctc ccccattctg tgtgtgtcgg 660 
aagaaaagat accaaagtag aagtcaaaag aaagacatgt 720 
catgcctata atcccagcac tttgggaagc cggggtagga 780 
gtccacacca gcctgggcaa cacagcaaga caccgcatct 84 0 
gctgggcgtg gtggtgtgta cctgtaggcc tagctgctca 900 
cacttgagcc tgagttcaac actgcagtga gctatggtgg 960 
ggtgacagag caagaccctg tctctaaaat aaattttaaa 1020 

1031 



caggaagaat gccttccagg agcttcagcg ctgcctgact 60 

78 



<220> 
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<223> Description of Artificial Sequence: Primer 



<400> 34 

aactcttgaa atgagaagcg tg 



22 



<210> 35 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 35 

aatatcatgc accatgaccc ac 22 



<210> 36 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 37 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 37 

ggagcttatc ctggattatc ta 22 



<210> 38 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 36 

tggagtaagt attgtaaact at 



22 



<400> 3B 

agagccacac atccatgtcc tg 



22 



<210> 39 
<211> 22 
<212> DNA 



<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer 



<400> 39 

aagccactct gtgaattgcc at 



22 



<210> 40 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 40 

gagtagtcgt agtaccagat gg 22 



<210> 41 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Primer 



<210> 42 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 42 

attagagcac atgaaggaaa gg 22 



<210> 43 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 41 

gtctggcaat ggagcatgaa aa 



22 



<400> 43 

acactgcttt gggggacagg ct 



22 



<210> 44 
<211> 22 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 44 

cacgacgcca cagagccagc tc 



<210> 45 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 45 

aaccaccacg gattcacgct tc 



<210> 46 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 46 

ataaccagat ggctgtgggt ca 



<210> 47 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 47 

atccccgcaa tgaaatagtt ta 



<210> 48 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 48 

gttgagagcc cacttagata at 
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<210> 49 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 49 

gcattggggg aagccaggac at 



<210> 50 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 50 

gccactagga ggcaatggca at 22 



<210> 51 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 51 

cgacggcatc acggccatct gg 22 



<210> 52 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 52 

tccaggctca ttcattttca tg 22 



<210> 53 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 



<400> 53 

tgacatcaac ttctcctttc ct 



22 
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<210> 54 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 54 

agttgcagag acctagcctg tc 



<210> 55 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 55 

tctgggagag gacggagctg gc 



<210> 56 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 56 

tgtaggacta tattgctc 



<210> 57 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 57 

cgacatttag gtgacact 



<210> 58 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: BstXI-linker 
adapter 
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<400> 58 

gtctt caeca egggg 



<210> 59 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: BstXI-linker 
adapter 

<400> 59 

gtggtgaaga c 11 



<210> 60 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<2 23> Description of Artificial Sequence: Synthetic 
peptide 

<400> 60 

Asp Pro Gin Ala Asp Gin Val Gin Met 
1 5 



<210> 61 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<2 23> Description of Artificial Sequence: Synthetic 
peptide 

<400> 61 

Asp Pro Gin Asp Gin Val Gin Met 
1 5 



<210> 62 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> M0D_RES 
<222> (1) . . (11) 

<223> "Xaa" represents a variable amino acid 



<22Q> 

<223> Description of Artificial Sequence: Zn-binding 
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consensus sequence 



<400> 62 

His Glu Xaa Xaa His Xaa Xaa Gly Xaa Xaa His 
1 5 10 



<210> 63 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Primer 
<400> 63 

ctgcctagag gccgagga 18 



<210> 64 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 65 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 65 

ttgcctgaac cttcctatcc 20 



<210> 66 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 64 

caggagacca cggaagatcg 



20 



<400> 66 

cccctgtgtt cctcaggtc 



19 



<210> 67 
<211> 20 
<212> DNA 



WO 01/78894 



PCT/US01/12245 



86/154 



<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 



<400> 67 

gctccacact ctttcttgcc 



20 



<210> 68 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 68 

aggcaggagg aagctgaat 19 

<210> 69 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 70 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 70 

cctacccctc tgcacccta 19 



<210> 71 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 69 

cctaccacac cctccctctt 



20 



«c400> 71 

aacttccttc tgggagctgg 



20 



<210> 72 
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<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 72 

cacaccctgg tgaggagaga 



<210> 73 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 73 

ccacgaagga ccaccg 



<210> 74 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 74 

ctcacgtggg tgcctctg 



<210> 75 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 75 

ctctacggcc gcagtgac 



<210> 7S 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 75 

gtccctccat gcccaatg 
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<210> 77 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 77 

caggttaagt cggctcgc 18 



<210> 78 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 79 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 79 

tctactgtgg ggaagatggg 2 0 



<210> 80 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 81 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 78 

ctctctctgc cttccccac 



19 



<400> 80 

cccctctact tcctcccca 



19 



<400> 81 
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gaccttgggg ttcctaatcc 



<210> 82 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 82 

gtgcacctgc tcaggactc 



<210> 83 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 83 

cctggactct tatcacgttg c 



<210> 84 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 84 

ttaccctcca ccatttctcc 



<210> 85 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 85 

gtggagaggg aagggagaag 

<210> 86 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
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<400> 86 

ccccatgggt tgaatttaca 



20 



<210> 87 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 87 

gcagctaggc ctacaggtac a 21 



<210> 88 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 89 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 89 

aggtgtagca ctgggattgg 2 0 

<210> 90 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 88 

accacgccta tagccaacat 



20 



<400> 90 

ccccaggacc actagcttct 



20 



<210> 91 
<211> 20 
<212> DNA 



<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer 
<400> 91 

attgagctgg agagtgtgcc 

<210> 92 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 92 

ttcaagttcc tggagtggct 



<210> 93 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 93 

acaaggaccc tctaaacgca 



<210> 94 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 94 

acccttctgt gacaagccag 

<210> 95 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 95 

gtgttgctac cgactcccag 



<210> 96 
<211> 18 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 96 

cccaggtgca gagagcag 18 



<210> 97 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 97 

gctcctcttg tccactctcc t 21 



<210> 98 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 98 

gccacttcct ctgcacaaat 20 

<210> 99 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 99 

ttctctgtga cctgggtggt 2 0 



<210> 100 

<211> 18 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 100 

atttgggcca gagatggg 18 



<210> 101 
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<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: Primer 
<400> 101 

ggcagaggag caaggtgg 18 



<210> 102 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 102 

atggcttgga atcatcaagg 2 0 



<210> 103 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 103 

tagagagagg aggtgccagc 2 0 



<210> 104 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 104 

aaagatggcc cacacagg 18 



<210> 105 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 105 

agaactctca tgagcccagc 20 
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<210> 106 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 106 

agctctgagc agaacccatc 



<210> 107 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 107 

ctcgaggggg tggagctg 



<210> 108 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 108 

gagaggagga gagaaccgct 



<210> 109 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Primer 
<400> 109 

agtgacttgg tggttctggg 



<210> 110 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 110 
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tgtcatctgc accctctctg 



<210> 111 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 111 

aagagggagg gtgtggtagg 



<210> 112 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 112 

gtgatcaggc cactagggtg 



<210> 113 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 113 

atacagcatt cccactccca 



<210> 114 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 114 

gaaggcagaa atcccggt 



<210> 115 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
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<400> 115 

caccagcacc tgcctgtc 



<210> 116 
<211> 17 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 



<400> 116 

gggtcagagg cacccac 



<210> 117 
<211> 19 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 



<400> 117 

gccgtagagc ctcctgtct 



<210> 118 
<211> 19 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 118 

gacgaccaaa gaaacgcag 



<210> 119 
<211> 18 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 119 

tgagcggaga gggcaagt 



<210> 120 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer 
<400> 120 

aaaccctcac cctgaacctt 2 0 

<210> 121 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 121 

aagggtgctc gtgtcctct 19 



<210> 122 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 122 

ccactcagct ccactcccta 20 



<210> 123 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 123 

ggattcaaac ggcaaggag 



<210> 124 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 124 

gctgagtcct gagcaggtg 



<210> 125 
<211> 19 
<212> DNA 



WO 01/78894 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 125 

gaaccgcagg agtaggctc 

<210> 126 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 126 

atatggtcag caggagaccc 



<210> 127 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 127 

gcatcctggt ctccatgata a 



<210> 128 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 128 

gaggctttga atccaggtcc 



<210> 129 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 129 

cagcaagaca ccgcatctac 



<210> 130 
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<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 130 

gggacagagg gaaccattta 20 



<210> 131 
<211> 20 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<22 3> Description of Artificial Sequence: Primer 
<400> 131 

ttccttcctg tttcttccca 20 



<210> 132 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 132 

gtcctgggag tctggtgtgt 20 



<210> 133 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 133 

aggaacccag agccacacta 



<210> 134 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 134 

tgcctctggt gagaggtagc 



20 
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<210> 135 
<211> 20 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 135 

ttcctggatc actggtcctc 2 0 

<210> 136 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 136 

ttcgagcagt gagagaaacc t 21 



<210> 137 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 137 

ctgggagtcg gtagcaaca 19 



<210> 138 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 138 

aggccactgg aacctcct 



<210> 139 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 139 
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gcagcatggt acagggactg 20 

<210> 140 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 140 

cagctgacca gtggtatgga 2 0 

<210> 141 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 141 

tgtcagacat ggccacagag 20 

<210> 142 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 142 

agggtcctct tagctgccac 2 0 

<210> 143 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 143 

aggccttgtc atttcctgtg 20 

<210> 144 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 



WO 01/78894 PCT/US01/12245 

102/154 

<400> 144 

caaagaacct tggatgtccg 20 

<210> 145 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 145 

ctcagctccc ttcctgctc 19 

<210> 146 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 146 

ctgtgtgggc catctttg 18 

<210> 147 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 147 

ggagaaatgg tggagggtaa 2 0 

<210> 148 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 148 

aaagccacag cttctccct 19 

<210> 149 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer 
<400> 149 

aggtttctgg gctcaggtta 



<210> 150 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 150 

gtaggtgtgc cagagcagg 



<210> 151 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 151 

tgtggaccta gaatggtgag c 



<21Q> 152 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223=> Description of Artificial Sequence: Primer 
<400> 152 

caaagtcaca caacaagcgg 



<210> 153 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 153 

caggatcttg gcatctggac 



<210> 154 
<211> 20 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 154 

ctggcttgtc acagaagggt 

<210> 155 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 155 

ctggagcaca gtggcagtta 



<210> 156 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 156 

tttggtcgtc cctcagtttc 



<210> 157 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 157 

cctctcagga gtagaggccc 



<210> 158 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 158 

agcggttctc tcctcctctc 



<210> 159 
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<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 159 

cctctcagga gtagaggccc 2 0 

<210> 160 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 160 

atgttactga ggccgaaagg 20 

<210> 161 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 161 

ccctttccag ccttctcttt 2 0 

<210> 162 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: Primer 
<400> 162 

caggactgca aacatcctga 20 

<210> 163 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 163 

tccctggtgc ttcccata 18 
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<210> 164 
<211> 19 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 164 

aggcaggagg aagctgaat 

<210> 165 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 165 

cctcttgccc ctcttgct 



<210> 166 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 166 

cctgaatgtc cagagtcctg a 



<210> 167 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 167 

ggcctcgagt cccagtattt 



<210> 168 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 168 
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agagcctcct gtctctccct 



<210> 169 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 169 

tcgccctcag cttctcag 



<210> 170 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> Description of Artificial Sequence: Primer 
<400> 170 

tcacgtgggt gcctctga 

<210> 171 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 171 

gggttacttc ccctctctgg 



<210> 172 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 172 

ctgggctttc caccctgg 



<210> 173 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
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<400> 173 

ctgggctttc caccctgg 



<210> 174 
<211> 19 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 174 

tccaggtggt gaactctgc 



<210> 175 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 



<400> 175 

tagaatggtg agctctgccc 



<210> 176 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 176 

gaccttgggg ttcctaatcc 



<210> 177 
<211> 19 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence; Primer 
<400> 177 

ccaagcacac ttgagcgtc 



<210> 178 
<211> 18 
<212> DNA 
<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer 
<400> 178 

agccatgccc tctgcttt 



<210> 179 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 179 

cagcccaagc acacttga 



<210> 180 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 180 

cccatagctg tgagctcctc 



<210> 181 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 181 

aaagcttcag gacccacaaa 



<210> 182 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 182 

atcttggtcc ctgccattc 



<210> 183 
<211> 18 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 183 

gagggagctc tttcccca 



<210> 184 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 184 

ggaccaccag gaaggctg 



<210> 185 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 185 

aaccccagct cccagaag 



<210> 186 
<211> 20 
c212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 186 

ctgctcacct ggaaaggaac 



<210> 187 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 187 

actgcaggaa ggcccagag 



<210> 188 
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<211> 20 
<:212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 188 

accgaaactt gaaccacacc 



<210> 189 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 189 

tgagggacga ccaaagaaac 



<210> 190 
<2ll> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 190 

caaagtcaca caacaagcgg 



<210> 191 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 191 

gaacctgagg gcaccaatta 



<210> 192 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 192 

ttggccttag ttaattggtg c 
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<210> 193 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 193 

ttggccttag ttaattggtg c 21 

<210> 194 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 194 

ctggagcaca gtggcagtta 2 0 



<210> 195 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 195 

aggagtaggc tcaggaagca 



<210> 196 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 196 

tgtactggga ggtagagggc 



<210> 197 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 



<400> 197 



WO 01/78894 



113/154 



agagggtgac ttggagcaga 



<210> 198 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<22 0> 

<223> Description of Artificial Sequence: Primer 
<400> 198 

aggcaataac ccactcagga 

<210> 199 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 199 

cccatgggtt gaatttacat a 



<210> 200 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 200 

gcctctggtg atcctcctac 



<210> 201 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 201 

actcagtcga accatagggc 



<210> 202 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
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<400> 202 

tgtgtgacct ttgcttctgg 



<210> 203 
<211> 20 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 



<400> 203 

gcatgaagca atgggagaat 



<210> 204 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequencer Primer 
<400> 204 

actcagtcga accatagggc 



<210> 205 
<211> 20 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 205 

gcaggaaggt gtcatggtct 



<210> 206 
<211> 20 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence; Primer 
<400> 206 

gcaggaaggt gtcatggtct 



<210> 207 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer 
<400> 207 

gggcattgga gaggcaag 18 



<210> 208 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 208 

tctgcctccc agattcaagt 



<210> 209 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 209 

agaatgcctt ccaggagctt 20 



<210> 210 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<22 0> 

<223> Description of Artificial Sequence: Primer 
<400> 210 

gtgttgctac cgactcccag 



<210> 211 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 211 

ctgcttcctg agcctactcc 



<210> 212 
<211> 19 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 212 

aacaggaggt tccagtggc 



<210> 213 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 213 

agcgagttgt gattgagggt 



<210> 214 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 214 

tgtgcaggct gaaagtatgc 



<210> 215 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 215 

gccacttcct ctgcacaaat 



<210> 216 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 216 

ctgagcccag aaacctgatt 



<210> 217 
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<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 217 

gtgagtgagg caccaggg 



<210> 218 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 218 

cctagatggc caggaagtga 



<210> 219 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 219 

ccagaaacct gattaggggg 



<210> 220 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 220 

tacctctcac cagaggcagg 



<210> 221 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 221 

gccagaagct agtggtcctg 
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<210> 222 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 222 

gcaggcagct tggaagttt 19 

<210> 223 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 223 

ttatcatgga gaccaggatg c 21 

<210> 224 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 224 

gacctggatt caaagcctcc 20 

<210> 225 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 225 

atgttggcta taggcgtggt 20 

<210> 226 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 226 
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ttatcatgga gaccaggatg c ,21 



<210> 227 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 227 

ctgagtggag ggagcagaag 20 



<210> 228 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 228 

ctgagtggag ggagcagaag 2 0 



<210> 229 
<2ll> 18 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 229 

ccatgagatc ggccacag 18 



<210> 230 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 230 

atttcaaggc tgcaatgagg 



<210> 231 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
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<400> 231 

acttctttcc atggcctctg 2 0 

<210> 232 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 232 

accacccagg tcacagagaa 20 

<210> 233 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 233 

tcccaagacc aggctatgtc 20 

<210> 234 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 234 

ctggggatga gaagcagc 18 

<210> 235 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 235 

cttctccctt ccctctccac 20 



<210> 236 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer 
<400> 236 

atttgtgcag aggaagtggc 



<210> 237 
<211> 20 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 237 

catttcctcc aggctctgac 



<210> 238 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: Primer 
<400> 23 8 

tcagagcctg gaggaaatgt 

<210> 239 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Primer 
<400> 239 

gttcctggag tgggtgggt 



<210> 240 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 240 

ctgggagtcg gtagcaaca 



<210> 241 
<211> 41 
<212> DNA 
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<213> Homo sapiens 
<400> 241 

gccctctgag accgacgggg agggacggct cgggccggtc a 



<210> 242 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 242 

caagaacctt cccagcggtt ctctcctcct ctcaggagta g 



<210> 243 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 243 

caccatctca gctccacact ctttcttgcc caggtctcga 



<210> 244 

<211> 41 

<212> DNA 

<213> Homo sapiens 



<400> 244 

ccaccatctc agctccacac tctttcttgc ccaggtctcg a 



<210> 245 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 245 

acaactaagc catcaccaag gctccttcct ctagccccaa g 



<210> 246 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 246 

tggtgcttcc catattcaca tctcccacaa ctaagccatc 



<210> 247 

<211> 41 

<212> DNA 

<213> Homo sapiens 



<400> 247 

caggatacat agaaacccac tacggcccag atgggcagcc a 
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<210> 248 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<40Q> 248 

ccctccaaat cagaagagac aggaattcac aggcctcgag t 



<210> 249 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 249 

agctgctcac ctggaaagga acctgtggcc acagggatcc t 41 



<210> 250 
<211> 41 
<212> DNA 

<213> Homo sapiens 
<400> 250 

acttccttct gggagctggg gttgggggtc agggctcaag c 41 

<210> 251 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 251 

ttcctgcagt ggcgccgggg gctgtgggcg cagcggcccc a 41 



<210> 252 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 252 

ggttcagggt gagggtttcg gggagcttgg gagccggcct g 41 



<210> 253 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 253 

cagagaagcg cgggggttgg gggactgtcc ctccatgccc 



<210> 254 
<211> 41 
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<212> DNA 

<213> Homo 



sapiens 



<400> 254 
cccctctctg 



ggctctgcgc gtctggcggc tgtagccaag c 



41 



<210> 255 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 255 

cagccgccgc cagctgcgcg ccttcttccg caaggggggc g 41 



<210> 256 
<211> 41 
<212> DNA 

<213> Homo sapiens 
<400> 256 

agtggcctcc cagtcaagcg agggggtgga tccctgcccc a 41 



<210> 257 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 257 

tgctggccat gctcctcagc gtcctgctgc ctctgctccc a 41 



<210> 258 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 258 

ctgctgcctc tgctcccagg ggccggcctg gcctggtgtt g 41 



<210> 259 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 253 

gaagtagctt tgaacaggag gttccagtgg cctcccagtc a 41 



<210> 260 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 260 
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gcctctgtct caccagtttt cggccctttg ccacttcctc t 



<210> 261 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 261 

acaaatcacc tctgtcaccc ccttgaagtt cccaaatgct g 



<210> 262 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 262 

tccataccac tggtcagctg cggtgctggc tgcccctgtg 



<210> 263 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 263 

ggtgctggct gcccctgtgc cagggccctg ccttaaccca g 



<210> 264 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 264 

ggaaatgaca aggccttggg ggatgggatg gggacagtca 



<210> 265 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 265 

agggctcatg cctcctgcct ccttccagat gggcagcacc c 



<210> 266 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 266 

gcccctcccc agccccaggg tctcctgctg accatattca c 



<210> 267 
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<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 267 

cctgggcggc gttcacccca tggagttggg ccccacagcc a 



<210> 268 

<211> 41 

<212> DNA 

<213> Homo sapiens 



<400> 268 

gccccacagc cactggacag ccctggcccc tgggtgagtg a 



<210> 269 

<211> 41 

<212> DNA 

<213> Homo sapiens 



<400> 269 

gccctggccc ctgggtgagt gaggcaccag ggggaggtgg a 



<210> 270 

<211> 41 

<212> DNA 

<213> Homo sapiens 



<400> 270 

tgcagcctgg ggccccagtc cttaggggac aacatatcct c 



<210> 271 

<211> 41 

<212> DNA 

<213> Homo sapiens 



<400> 271 

cactgagtga ggatgggctc tctgccacac agcttgcagc c 



<210> 272 

<211> 41 

<212> DNA 

<213> Homo sapiens 



<400> 272 

ctggtcctca ctgagtgagg atgggctctc tgccacacag c 



<210> 273 

<211> 41 

<212> DNA 

<213> Homo sapiens 
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<400> 273 

atgacctctt ggttatcatg gagaccagga tgctggaagc 



<210> 274 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 274 

agcaagacac cgcatctaca gaaaaatttt aaaattagct g 41 



<210> 275 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 275 

ggaggatcac cagaggccag caggtccaca ccagcctggg c 41 

<210> 276 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 276 

atcccagcac tttgggaagc cggggtagga ggatcaccag a 41 



<210> 277 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 277 

agcctggctg gcctctgcaa acaaacataa ttttggggac c 41 



<210> 278 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 278 

actgagtcca cactcccctg cagcctggct ggcctctgca a 41 

<210> 279 

<211> 41 

<212> DNA 

<213> Homo sapiens 



<400> 279 

tccaggaacc cagagccaca ttagaagttc ctgagggctg g 



41 
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<210> 280 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 280 

ttcttccccg agtggagctt cgacccaccc actccaggaa c 41 



<210> 281 

<211=» 41 

<212> DNA 

<213> Homo sapiens 

<400> 281 

tcctcattct cagcagatca agtccagatg ccaagatcct g 



<210> 282 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 282 

ctgaggacca cacggggtgg tggttggcgg ggtggtggtt g 



<210> 283 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 283 

ggctggcagg ccgagcctag atggcagcca gagccccagg c 41 



<210> 284 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 284 

ctttgctctg tcactcctgc ctcccttggg cgttcacatt c 41 



<210> 285 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 285 

gtgagctctg cccacccgac ccctccttgc cgtttgaatc c 41 



<210> 286 

<211> 41 

<212> DNA 

<213> Homo sapiens 
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<400> 286 

tggcgaggtt actcctacac cgggaggagc accgtcgggt c 



<210> 287 

<211> 41 

<212> DNA 

<213> Homo sapiens 



<400> 287 

ggctgctcac tattggggcc gcatcgtccc ctgtcccgct t 



<210> 288 

<211> 41 

<212> DNA 

<213> Homo sapiens 

<400> 288 

gccgcatcgt cccctgtccc gcttgttgtg tgactttgcg c 



<210> 289 
<211> 17 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 289 

gccgtcccac cccgtcg 



<210> 290 
<211> 17 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 290 

cctcctctct tggcgac 



<210> 291 
<211> 18 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 291 

tccacactct ttcttgcc 
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<210> 292 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 292 

gctccacact ctttcttgcc 



<210> 293 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Primer 
<400> 293 

tcaccaaggc tccttcct 



<210> 294 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 294 

cagaagagac aggaattcac a 



<210> 295 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 295 

tggaaaggaa cctgtggcc 

<210> 296 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 296 
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gggtttcggg gagcttg 



<210> 297 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 

<400> 297 
gggttggggg actgtc 



<210> 298 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 

<400> 298 
ctctgcgcgt ctggcg 



<210> 299 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 299 

gccgtccctc cccgtcg 



<210> 300 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 300 

tcctcctcta ttggcgaccc 



<210> 301 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
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<400> 301 

ctccacactt tttcttgccc 



<210> 302 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 302 

gctccacact ctttcttgc 



<210> 303 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 303 

tcaccaagcc tccttcct 



<210> 304 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 304 

agaagagacg ggaattcac 



<210> 305 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 305 

tggaaaggag cctgtgg 



<210> 306 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer 
<400> 306 

agggtttcgt ggagcttgg 



<210> 307 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 307 

ggggttggag gactgtcc 



<210> 308 
<211> 18 
<212> DNA 

<213> Artificial "Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 308 

gctctgcgca tctggcgg 



<210> 309 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 309 

agtcaagcga gggggtgg 

<210> 310 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 

<400> 310 
cctcagcgtc ctgctg 



<210> 311 
<211> 18 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 311 

aacaggaggt tccagtgg 



<210> 312 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 312 

accagttttc ggcccttt 



<210> 313 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 313 

ctgtcacccc cttgaagt 



<210> 314 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 

<400> 314 
tcagctgcgg tgctgg 



<210> 315 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 

<400> 315 
gccttggggg atgga 



<210> 316 
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<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence i Primer 
<400> 316 

tcctgcctcc ttccag 16 



<210> 317 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 317 

actggacagc cctggc 16 



<210> 318 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 318 

ctgtgtggca gagagccca 



<210> 319 - 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 319 

aattatgttt gtttgcagag gc 22 



<210> 320 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 320 

gaacttctag tgtggctct 



19 
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<210> 321 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 321 

ccaagggagg caggagt 

<210> 322 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 322 

agtcaagcgt gggggtgg 



<210> 323 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 323 

ctcctcagca tcctgctgc 



<210> 324 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 324 

gaacaggagt ttccagtggc 

<210> 325 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: Primer 



<400> 325 
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caccagtttt tggccctttg 



<210> 326 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 326 

ctgtcaccca cttgaagttc 



<210> 327 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 327 

ggtcagctgt ggtgctgg 



<210> 328 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 328 

aggccttggg agatgggat 



<210> 329 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 

<400> 329 
tcctgccttc ttccag 



<210> 330 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
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<400> 330 
actggacagt cctggc 



<210> 331 
<211> 16 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 



<400> 331 
tgtggcaggg agccca 



<210> 332 
<211> 20 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 



<400> 332 

attatgtttg cttgcagagg 



<210> 333 
<211> 21 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence.: Primer 
<400> 333 

ggaacttcta atgtggctct g 



<210> 334 
<211> 20 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 334 

cccaagggaa gcaggagtga 



<210> 335 
<211> 55 
<212> PRT 

<213> Homo sapiens 
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<400> 335 

Cys Cys Phe Ala His Asn Cys Ser Leu Arg Pro Gly Ala Gin Cys Ala 
15 10 15 

His Gly Asp Cys Cys Val Arg Cys Leu Leu Lys Pro Ala Gly Ala Leu 
20 "25 3 0 

Cys Arg Gin Ala Met Gly Asp Cys Asp Leu Pro Glu Phe Cys Thr Gly 
35 40 45 

Thr Ser Ser His Cys Pro Pro 
50 55 



<210> 336 
<211> 11 
<212> PRT 

<213> Homo sapiens 
<400> 336 

Thr Met Ala His Glu lie Gly His Ser Leu Gly 
15 10 



<210> 337 

<211> 86 

<212> PRT 

<213> Homo sapiens 

<400> 337 

Met Gly Trp Arg Pro Arg Arg Ala Arg Gly Thr Pro Leu Leu Leu Leu 
15 10 15 

Leu Leu Leu Leu Leu Leu Trp Pro Val Pro Gly Ala Gly Val Leu Gin 
20 25 30 

Gly His lie Pro Gly Gin Pro Val Thr Pro His Trp Val Leu Asp Gly 
35 40 45 

Gin Pro Trp Arg Thr Val Ser Leu Glu Glu Pro Val Ser Lys Pro Asp 
50 55 60 

Met Gly Leu Val Ala Leu Glu Ala Glu Gly Gin Glu Leu Leu Leu Glu 
65 70 75 80 

Leu Glu Lys Asn His Arg 
85 



<210> 338 
<211> 48 
<212> PRT 

<213> Homo sapiens 



<400> 338 

Met Gly Trp Arg Pro Arg Arg Ala Arg Gly Thr Pro Leu Leu Leu Leu 
15 10 15 
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Leu Leu Leu Leu Leu Leu Trp Pro Val Pro Gly Ala Gly Val Leu Gin 
20 25 30 

Gly His lie Pro Gly Gin Pro Val Thr Pro His Trp Val Leu Asp Gly 
35 40 45 



<210> 339 
<211> 178 
<212> PRT 

<213> Homo sapiens 
<400> 339 

Met Gly Trp Arg Pro Arg Arg Ala Arg Gly Thr Pro Leu Leu Leu Leu 
1 5 10 15 

Leu Leu Leu Leu Leu Leu Trp Pro Val Pro Gly Ala Gly Val Leu Gin 
20 25 30 

Gly His lie Pro Gly Gin Pro Val Thr Pro His Trp Val Leu Asp Gly 
35 40 45 

Gin Pro Trp Arg Thr Val Ser Leu Glu Glu Pro Val Ser Lys Pro Asp 
50 55 60 

Met Gly Leu Val Ala Leu Glu Ala Glu Gly Gin Glu Leu Leu Leu Glu 
65 70 75 80 

Leu Glu Lys Asn His Arg Leu Leu Ala Pro Gly Tyr lie Glu Thr His 
85 90 95 

Tyr Gly Pro Asp Gly Gin Pro Val Val Leu Ala Pro Asn His Thr Val 
100 105 110 

Arg Cys Phe His Gly Leu Trp Asp Ala Pro Pro Glu Asp His Cys His 
115 120 125 

Tyr Gin Gly Arg Val Arg Gly Phe Pro Asp Ser Trp Val Val Leu Cys 
130 135 140 

Thr Cys Ser Gly Met Ser Gly Leu lie Thr Leu Ser Arg Asn Ala Ser 
145 150 155 160 

Tyr Tyr Leu Arg Pro Trp Pro Pro Arg Gly Ser Lys Asp Phe Ser Thr 
165 170 175 

His Glu 



<210> 340 
<211> 113 
<212> PRT 

<213> Homo sapiens 



<400> 340 

Met Gly Trp Arg Pro Arg Arg Ala Arg Gly Thr Pro Leu Leu Leu Leu 
1 ~ 5 10 15 
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Leu Leu Leu Leu Leu Leu Trp Pro 
20 

Gly His lie Pro Gly Gin Pro Val 
35 40 

Gin Pro Trp Arg Thr Val Ser Leu 
50 55 

Met Gly Leu Val Ala Leu Glu Ala 
65 70 

Leu Glu Lys Asn His Gly Leu lie 
85 

Tyr Leu Arg Pro Trp Pro Pro Arg 
100 

Glu 



Val Pro Gly Ala Gly Val Leu Gin 
25 30 

Thr Pro His Trp Val Leu Asp Gly 
45 

Glu Glu Pro Val Ser Lys Pro Asp 
60 

Glu Gly Gin Glu Leu Leu Leu Glu 
75 80 

Thr Leu Ser Arg Asn Ala Ser Tyr 
90 95 

Gly Ser Lys Asp Phe Ser Thr His 
105 110 



<210> 341 
<211> 165 
<212> PRT 
<213> Homo sapiens 

<400> 341 

Met Gly Trp Arg Pro Arg Arg Ala Arg Gly Thr Pro Leu Leu Leu Leu 
15 10 15 

Leu Leu Leu Leu Leu Leu Trp Pro Val Pro Gly Ala Gly Val Leu Gin 
20 25 30 

Gly His lie Pro Gly Gin Pro Val Thr Pro His Trp Val Leu Asp Gly 
35 4 0 45 

Gin Pro Trp Arg Thr Val Ser Leu Glu Glu Pro Val Ser Lys Pro Asp 
50 55 60 

Met Gly Leu Val Ala Leu Glu Ala Glu Gly Gin Glu Leu Leu Leu Glu 
65 70 75 80 

Leu Glu Lys Asn His Arg Leu Leu Ala Pro Gly Tyr lie Glu Thr His 
85 90 95 

Tyr Gly Pro Asp Gly Gin Pro Val Val Leu Ala Pro Asn His Thr Asp 
100 105 110 

His Cys His Tyr Gin Gly Arg Val Arg Gly Phe Pro Asp Ser Trp Val 
115 " 120 ~ " 125 

Val Leu Cys Thr Cys Ser Gly Met Ser Gly Leu lie Thr Leu Ser Arg 
130 135 140 



Asn Ala Ser Tyr Tyr Leu Arg Pro Trp Pro Pro Arg Gly Ser Lys Asp 
14 5 150 155 160 
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Phe Ser Thr His Glu 
165 



<210> 342 
<211> 168 
<212> PRT 

<213> Homo sapiens 
<400> 342 

Leu Ala Pro Gly Tyr lie Glu Thr His Tyr Gly Pro Asp Gly Gin Pro 
15 10 15 

Val Val Leu Ala Pro Asn His Thr Asp His Cys His Tyr Gin Gly Arg 
20 25 30 

Val Arg Gly Phe Pro Asp Ser Trp Val Val Leu Cys Thr Cys Ser Gly 
35 40 45 

Met Ser Gly Leu lie Thr Leu Ser Arg Asn Ala Ser Tyr Tyr Leu Arg 
50 55 60 

Pro Trp Pro Pro Arg Gly Ser Lys Asp Phe Ser Thr His Glu lie Phe 
65 70 75 80 

Arg Met Glu Gin Leu Leu Thr Trp Lys Gly Thr Cys Gly His Arg Asp 
85 90 95 

Pro Gly Asn Lys Ala Gly Met Thr Ser Leu Pro Gly Gly Pro Gin Ser 
100 105 110 

Arg Gly Arg Arg Lys Ala Arg Arg Thr Arg Lys Tyr Leu Glu Leu Tyr 
115 120 125 

lie Val Ala Asp His Thr Leu Phe Leu Thr Arg His Arg Asn Leu Asn 
130 135 140 

His Thr Lys Gin Arg Leu Leu Glu Val Ala Asn Tyr Val Asp Gin Leu 
145 150 155 160 



Leu Arg Thr Leu Asp lie Gin Val 
165 



<210> 343 
<211> 167 
<212> PRT 

<213> Homo sapiens 
<400> 343 

Ser Gly Tyr Cys Trp Asp Gly Ala Cys Pro Thr Leu Glu Gin Gin Cys 
15 10 15 

Gin Gin Leu Trp Gly Pro Gly Ser His Pro Ala Pro Glu Ala Cys Phe 
20 25 30 



Gin Val Val Asn Ser Ala Gly Asp Ala His Gly Asn Cys Gly Gin Asp 
35 40 45 
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Ser Glu Gly His Phe Leu Pro Cys 
50 55 

Lys Leu Gin Cys Gin Gly Gly Lys 
65 70 

Val Pro Val Asp Ser Thr Val His 
85 

Arg Gly Ala Leu Ala Leu Pro Ser 
100 



Ala Gly Arg Asp Ala Leu Cys Gly 
60 

Pro Ser Leu Leu Ala Pro His Met 
75 80 

Leu Asp Gly Gin Glu Val Thr Cys 
90 95 

Ala Gin Leu Asp Leu Leu Gly Leu 
105 110 



Gly Leu Val Glu Pro Gly Thr Gin Cys Gly Pro Arg Met Val Cys Asn 
115 120 125 

Ser Asn His Asn Cys His Cys Ala Pro Gly Trp Ala Pro Pro Phe Cys 
130 135 140 

Asp Lys Pro Gly Phe Gly Gly Ser Met Asp Ser Gly Pro Val Gin Ala 
145 150 ' 155 160 

Glu Asn His Asp Thr Phe Leu 
165 



<210> 344 
<211> 193 
<212> PRT 

<213> Homo sapiens 



<400> 344 

Ser Gly Tyr Cys Trp Asp Gly Ala 
1 5 

Gin Gin Leu Trp Gly Pro Gly Ser 
20 

Gin Val Val Asn Ser Ala Gly Asp 
35 40 

Ser Glu Gly His Phe Leu Pro Cys 
50 55 



Cys Pro Thr Leu Glu Gin Gin Cys 
10 15 

His Pro Ala Pro Glu Ala Cys Phe 
25 30 

Ala His Gly Asn Cys Gly Gin Asp 
45 

Ala Gly Arg Asp Ala Leu Cys Gly 
60 



Lys Leu Gin Cys Gin Gly Gly Lys 
65 70 

Val Pro Val Asp Ser Thr Val His 
85 

Arg Gly Ala Leu Ala Leu Pro Ser 
100 



Pro Ser Leu Leu Ala Pro His Met 
75 80 

Leu Asp Gly Gin Glu Val Thr Cys 
90 95 

Ala Gin Leu Asp Leu Leu Gly Leu 
105 110 



Gly Leu Val Glu Pro Gly Thr Gin Cys Gly Pro Arg Met Val Cys Gin 
115 120 125 

Ser Arg Arg Cys Arg Lys Asn Ala Phe Gin Glu Leu Gin Arg Cys Leu 
130 135 140 
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Thr Ala Cys His Ser His Gly Val 
145 150 

Cys Ala Pro Gly Trp Ala Pro Pro 
165 

Gly Ser Met Asp Ser Gly Pro Val 
180 

Leu 



Cys Asn Ser Asn His Asn Cys His 
155 160 

Phe Cys Asp Lys Pro Gly Phe Gly 
170 175 

Gin Ala Glu Asn His Asp Thr Phe 
185 190 



<210> 345 
<211> 126 
<212> PRT 
<213> Homo sapiens 

<400> 345 

Ser Gly Tyr Cys Trp Asp Gly Ala Cys Pro Thr Leu Glu Gin Gin Cys 
15 10 15 

Gin Gin Leu Trp Gly Pro Asp Gly Gin Glu Val Thr Cys Arg Gly Ala 
20 25 30 

Leu Ala Leu Pro Ser Ala Gin Leu Asp Leu Leu Gly Leu Gly Leu Val 
35 40 45 

Glu Pro Gly Thr Gin Cys Gly Pro Arg Met Val Cys Gin Ser Arg Arg 
50 55 60 

Cys Arg Lys Asn Ala Phe Gin Glu Leu Gin Arg Cys Leu Thr Ala Cys 
65 70 75 80 

His Ser His Gly Val Cys Asn Ser Asn His Asn Cys His Cys Ala Pro 
85 90 95 

Gly Trp Ala Pro Pro Phe Cys Asp Lys Pro Gly Phe Gly Gly Ser Met 
100 105 110 

Asp Ser Gly Pro Val Gin Ala Glu Asn His Asp Thr Phe Leu 
115 120 125 



<210> 346 

<211> 93 

<212> PRT 

<213> Homo sapiens 

<400> 346 

Ala Trp Cys Cys Tyr Arg Leu Pro Gly Ala His Leu Gin Arg Cys Ser 
15 10 15 

Trp Gly Cys Arg Arg Asp Pro Ala Cys Ser Gly Pro Lys Asp Gly Pro 
20 25 30 

His Arg Asp His Pro Leu Gly Gly Val His Pro Met Glu Leu Gly Pro 
35 ~ 40 45 
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Thr Ala Thr Gly Gin Pro Trp Pro 
50 55 

Pro Ser Ser His Pro Glu Lys Pro 
65 70 

Gin Ala Asp Gin Val Gin Met Pro 
85 



Leu Asp Pro Glu Asn Ser His Glu 
60 

Leu Pro Ala Val Ser Pro Asp Pro 
75 80 

Arg Ser Cys Leu Trp 
90 



<210> 347 

<211> 236 

<212> PRT i 

<213> Homo sapiens 



<400> 347 

Ser Gly Tyr Cys Trp Asp Gly Ala 
1 5 

Gin Gin Leu Trp Gly Pro Asp Gly 
20 

Leu Ala Leu Pro Ser Ala Gin Leu 
35 40 



Cys Pro Thr Leu Glu Gin Gin Cys 
10 15 

Gin Glu Val Thr Cys Arg Gly Ala 
25 30 

Asp Leu Leu Gly Leu Gly Leu Val 
45 



Glu Pro Gly Thr Gin Cys Gly Pro 
50 55 

Cys Arg Lys Asn Ala Phe Gin Glu 
65 70 

His Ser His Gly Val Cys Asn Ser 
85 

Gly Trp Ala Pro Pro Phe Cys Asp 
100 

Asp Ser Gly Pro Val Gin Ala Glu 
115 120 

Met Leu Leu Ser Val Leu Leu Pro 
130 135 



Arg Met Val Cys Gin Ser Arg Arg 
60 

Leu Gin Arg Cys Leu Thr Ala Cys 
75 80 

Asn His Asn Cys His Cys Ala Pro 
90 95 

Lys Pro Gly Phe Gly Gly Ser Met 
105 110 

Asn His Asp Thr Phe Leu Leu Ala 
125 

Leu Leu Pro Gly Ala Gly Leu Ala 
140 



Trp Cys Cys Tyr Arg Leu Pro Gly 
145 150 

Gly Cys Arg Arg Asp Pro Ala Cys 
165 

Arg Asp His Pro Leu Gly Gly Val 
180 



Ala His Leu Gin Arg Cys Ser Trp 
155 160 

Ser Gly Pro Lys Asp Gly Pro His 
170 175 

His Pro Met Glu Leu Gly Pro Thr 
185 190 



Ala Thr Gly Gin Pro Trp Pro Leu Asp Pro Glu Asn Ser His Glu Pro 
195 200 205 

Ser Ser His Pro Glu Lys Pro Leu Pro Ala Val Ser Pro Asp Pro Gin 
210 215 220 
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Ala Asp Gin Val Gin Met Pro Arg Ser Cys Leu Trp 
225 230 235 



<210> 348 
<211> 302 
<212> PRT 

<213> Homo sapiens 
<400> 348 

Ser Gly Tyr Cys Trp Asp Gly Ala Cys Pro Thr Leu Glu Gin Gin Cys 
1 5 10 15 

Gin Gin Leu Trp Gly Pro Gly Ser His Pro Ala Pro Glu Ala Cys Phe 
20 25 30 

Gin Val Val Asn Ser Ala Gly Asp Ala His Gly Asn Cys Gly Gin Asp 
35 40 45 

Ser Glu Gly His Phe Leu Pro Cys Ala Gly Arg Asp Ala Leu Cys Gly 
50 55 60 

Lys Leu Gin Cys Gin Gly Gly Lys Pro Ser Leu Leu Ala Pro His Met 
65 70 75 80 

Val Pro Val Asp Ser Thr Val His Leu Asp Gly Gin Glu Val Thr Cys 
85 90 95 

Arg Gly Ala Leu Ala Leu Pro Ser Ala Gin Leu Asp Leu Leu Gly Leu 
100 105 110 

Gly Leu Val Glu Pro Gly Thr Gin Cys Gly Pro Arg Met Val Cys Gin 
115 120 125 

Ser Arg Arg Cys Arg Lys Asn Ala Phe Gin Glu Leu Gin Arg Cys Leu 
130 135 140 

Thr Ala Cys His Ser His Gly Val Cys Asn Ser Asn His Asn Cys His 
145 150 155 160 

Cys Ala Pro Gly Trp Ala Pro Pro Phe Cys Asp Lys Pro Gly Phe Gly 
165 170 175 

Gly Ser Met Asp Ser Gly Pro Val Gin Ala Glu Asn His Asp Thr Phe 
180 185 190 

Leu Leu Ala Met Leu Leu Ser Val Leu Leu Pro Leu Leu Pro Gly Ala 
195 200 205 

Gly Leu Ala Trp Cys Cys Tyr Arg Leu Pro Gly Ala His Leu Gin Arg 
210 215 220 

Cys Ser Trp Gly Cys Arg Arg Asp Pro Ala Cys Ser Gly Pro Lys Asp 
225 230 235 240 

Gly Pro His Arg Asp His Pro Leu Gly Gly Val His Pro Met Glu Leu 
245 250 255 
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Gly Pro Thr Ala Thr Gly Gin Pro 
260 

His Glu Pro Ser Ser His Pro Glu 
275 280 

Asp Pro Gin Asp Gin Val Gin Met 

290 295 



Trp Pro Leu Asp Pro Glu Asn Ser 
265 270 

Lys Pro Leu Pro Ala Val Ser Pro 
285 

Pro Arg Ser Cys Leu Trp 
300 



<210> 349 
<211> 235 
<212> PRT 

<213> Homo sapiens 
<400> 349 

Ser Gly Tyr Cys Trp Asp Gly Ala Cys Pro Thr Leu Glu Gin Gin Cys 
15 10 15 

Gin Gin Leu Trp Gly Pro Asp Gly Gin Glu Val Thr Cys Arg Gly Ala 
20 25 "30 

Leu Ala Leu Pro Ser Ala Gin Leu Asp Leu Leu Gly Leu Gly Leu Val 
35 40 45 

Glu Pro Gly Thr Gin Cys Gly Pro Arg Met Val Cys Gin Ser Arg Arg 
50 55 60 

Cys Arg Lys Asn Ala Phe Gin Glu Leu Gin Arg Cys Leu Thr Ala Cys 
65 70 75 80 

His Ser His Gly Val Cys Asn Ser Asn His Asn Cys His Cys Ala Pro 
85 90 95 

Gly Trp Ala Pro Pro Phe Cys Asp Lys Pro Gly Phe Gly Gly Ser Met 
100 105 110 

Asp Ser Gly Pro Val Gin Ala Glu Asn His Asp Thr Phe Leu Leu Ala 
115 120 125 

Met Leu Leu Ser Val Leu Leu Pro Leu Leu Pro Gly Ala Gly Leu Ala 
13 0 135 140 

Trp Cys Cys Tyr Arg Leu Pro Gly Ala His Leu Gin Arg Cys Ser Trp 
145 150 155 160 

Gly Cys Arg Arg Asp Pro Ala Cys Ser Gly Pro Lys Asp Gly Pro His 
165 170 175 

Arg Asp His Pro Leu Gly Gly Val His Pro Met Glu Leu Gly Pro Thr 
180 185 190 

Ala Thr Gly Gin Pro Trp Pro Leu Asp Pro Glu Asn Ser His Glu Pro 
195 200 205 



Ser Ser His Pro Glu Lys Pro Leu Pro Ala Val Ser Pro Asp Pro Gin 
210 215 220 
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Asp Gin Val Gin Met Pro Arg Ser Cys Leu Trp 
225 230 235 



<210> 350 
<211> 339 
<212> DNA 

<213> Homo sapiens 
<400> 350 

cgggcacggg tcggccgcaa tccagcctgg gcggagccgg agttgcgagc . cgctgcctag 60 
aggccgagga gctcacagct atgggctgga ggccccggag agctcggggg accccgttgc 120 
tgctgctgct actactgctg ctgctctggc cagtgccagg cgccggggtg cttcaaggac 180 
atatccctgg gcagccagtc accccgcact gggtcctgga tggacaaccc tggcgcaccg 240 
tcagcctgga ggagccggtc tcgaagccag acatggggct ggtggccctg gaggctgaag 300 
gccaggagct cctgcttgag ctggagaaga accacaggc 339 

<210> 351 
<211> 225 
<212> DNA 

<213> Homo sapiens 
<400> 351 

cgggcacggg tcggccgcaa tccagcctgg gcggagccgg agttgcgagc cgctgcctag 60 
aggccgagga gctcacagct atgggctgga ggccccggag agctcggggg accccgttgc 12 0 
tgctgctgct actactgctg ctgctctggc cagtgccagg cgccggggtg cttcaaggac 180 
atatccctgg gcagccagtc accccgcact gggtcctgga tggac 225 



<210> 352 
<211> 562 
<212> DNA 
<213> Homo sapiens 

<400> 352 

gcctagaggc cgaggagctc acagctatgg gctggaggcc ccggagagct cgggggaccc 60 
cgttgctgct gctgctacta ctgctgctgc tctggccagt gccaggcgcc ggggtgcttc 120 
aaggacatat ccctgggcag ccagtcaccc cgcactgggt cctggatgga caaccctggc 180 
gcaccgtcag cctggaggag ccggtctcga agccagacat ggggctggtg gccctggagg 24 0 
ctgaaggcca ggagctcctg cttgagctgg agaagaacca caggctgctg gccccaggat 3 00 
acatagaaac ccactacggc ccagatgggc agccagtggt gctggccccc aaccacacgg 360 
tgagatgctt ccatgggctc tgggatgcac cgccagagga tcattgccac taccaagggc 42 0 
gagtaagggg cttccccgac tcctgggtag tcctctgcac ctgctctggg atgagtggcc 4 80 
tgatcaccct cagcaggaat gccagctatt atctgcgtcc ctggccaccc cggggctcca 54 0 
aggacttctc aacccacgag at 562 



<210> 353 
<211> 362 
<212> DNA 
<213> Homo sapiens 

<400> 353 

gaggccgagg agctcacagc tatgggctgg aggccccgga gagctcgggg gaccccgttg 6 0 
ctgctgctgc tactactgct gctgctctgg ccagtgccag gcgccggggt gcttcaagga 12 0 
catatccctg ggcagccagt caccccgcac tgggtcctgg atggacaacc ctggcgcacc 18 0 
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gtcagcctgg aggagccggt ctcgaagcca 
ggccaggagc tcctgcttga gctggagaag 
gccagctatt atctgcgtcc ctggccaccc 
at 



149/154 

gacatggggc tggtggccct ggaggctgaa 24 0 
aaccatggcc tgatcaccct cagcaggaat 30 0 
cggggctcca aggacttctc aacccacgag 360 

362 



<210> 354 
<211> 518 
<212> DNA 
<213> Homo sapiens 

<400> 354 

gaggccgagg agctcacagc tatgggctgg 
ctgctgctgc tactactgct gctgctctgg 
catatccctg ggcagccagt caccccgcac 
gtcagcctgg aggagccggt ctcgaagcca 
ggccaggagc tcctgcttga gctggagaag 
gaaacccact acggcccaga tgggcagcca 
tgccactacc aagggcgagt aaggggcttc 
tctgggatga gtggcctgat caccctcagc 
ccaccccggg gctccaagga cttctcaacc 



aggccccgga gagctcgggg gaccccgttg 60 
ccagtgccag gcgccggggt gcttcaagga 12 0 
tgggtcctgg atggacaacc ctggcgcacc 18 0 
gacatggggc tggtggccct ggaggctgaa 24 0 
aaccacaggc tgctggcccc aggatacata 30 0 
gtggtgctgg cccccaacca cacggatcat 360 
cccgactcct gggtagtcct ctgcacctgc 42 0 
aggaatgcca gctattatct gcgtccctgg 48 0 
cacgagat 518 



<210> 355 
<211> 506 
<212> DNA 
<213> Homo sapiens 

<400> 355 

ctggccccag gatacataga aacccactac 
cccaaccaca cggatcattg ccactaccaa 
gtagtcctct gcacctgctc tgggatgagt 
tattatctgc gtccctggcc accccggggc 
cggatggagc agctgctcac ctggaaagga 
gcgggcatga ccagccttcc tggtggtccc 
acccggaagt acctggaact gtacattgtg 
cgaaacttga accacaccaa acagcgtctc 
ctcaggactc tggacattca ggtggc 



ggcccagatg ggcagccagt ggtgctggcc 60 
gggcgagtaa ggggcttccc cgactcctgg 12 0 
ggcctgatca ccctcagcag gaatgccagc 18 0 
tccaaggact tctcaaccca cgagatcttt 24 0 
acctgtggcc acagggatcc tgggaacaaa 300 
cagagcaggg gcaggcgaaa agcgcgcagg 360 
gcagaccaca ccctgttctt gactcggcac 42 0 
ctggaagtcg ccaactacgt ggaccagctt 480 

506 



<210> 356 
<211> 503 
<212> DNA 
<213> Homo sapiens 

<400> 356 

cagtggctac tgctgggatg gcgcatgtcc 
ggggcctggc tcccacccag ctcccgaggc 
tgctcatgga aactgcggcc aggacagcga 
tgccctgtgt gggaagctgc agtgccaggg 
ggtgccagtg gactctaccg ttcacctaga 
ggcactcccc agtgcccagc tggacctgct 
gtgtggacct agaatggttt gcaatagcaa 
tccacccttc tgtgacaagc caggctttgg 
tgaaaaccat gacaccttcc tgc 



cacgctggag cagcagtgcc agcagctctg 6 0 
ctgtttccag gtggtgaact ctgcgggaga 120 
gggccacttc ctgccctgtg cagggaggga 18 0 
tggaaagccc agcctgctcg caccgcacat 240 
tggccaggaa gtgacttgtc ggggagcctt 3 00 
tggcctgggc ctggtagagc caggcaccca 360 
ccataactgc cactgtgctc caggctgggc 42 0 
tggcagcatg gacagtggcc ctgtgcaggc 4 80 

5 03 



<210> 357 
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<211> 581 
<212> DNA 
<213> Homo sapiens 

<400> 357 

cagtggctac tgctgggatg gcgcatgtcc cacgctggag cagcagtgcc agcagctctg 60 
ggggcctggc tcccacccag ctcccgaggc ctgtttccag gtggtgaact ctgcgggaga 120 
tgctcatgga aactgcggcc aggacagcga gggccacttc ctgccctgtg cagggaggga 18 0 
tgccctgtgt gggaagctgc agtgccaggg tggaaagccc agcctgctcg caccgcacat 24 0 
ggtgccagtg gactctaccg ttcacctaga tggccaggaa gtgacttgtc ggggagcctt 3 00 
ggcactcccc agtgcccagc tggacctgct tggcctgggc ctggtagagc caggcaccca 360 
gtgtggacct agaatggtgt gccagagcag gcgctgcagg aagaatgcct tccaggagct 420 
tcagcgctgc ctgactgcct gccacagcca cggggtttgc aatagcaacc ataactgcca 480 
ctgtgctcca ggctgggctc cacccttctg tgacaagcca ggctttggtg gcagcatgga 540 
cagtggccct gtgcaggctg aaaaccatga caccttcctg c 581 



<210> 358 
<211> 380 
<212> DNA 
<213> Homo sapiens 

<400> 358 

cagtggctac tgctgggatg gcgcatgtcc 
ggggcctgat ggccaggaag tgacttgtcg 
ggacctgctt ggcctgggcc tggtagagcc 
ccagagcagg cgctgcagga agaatgcctt 
ccacagccac ggggtttgca atagcaacca 
acccttctgt gacaagccag gctttggtgg 
aaaccatgac accttcctgc 



cacgctggag cagcagtgcc agcagctctg 60 
gggagccttg gcactcccca gtgcccagct 120 
aggcacccag tgtggaccta gaatggtgtg 180 
ccaggagctt cagcgctgcc tgactgcctg 240 
taactgccac tgtgctccag gctgggctcc 300 
cagcatggac agtggccctg tgcaggctga 360 

• 3 80 



<210> 359 
<211> 324 
<212> DNA 
<213> Homo sapiens 

<400> 359 

ggcctggtgt tgctaccgac tcccaggagc 
aagggaccct gcgtgcagtg gccccaaaga 
cgttcacccc atggagttgg gccccacagc 
gaactctcat gagcccagca gccaccctga 
ccaagcagat caagtccaga tgccaagatc 
gaacagattt aaagacaggt ggcc 



<210> 360 
<211> 753 
<212> DNA 
<213> Homo sapiens 

<400> 360 

cagtggctac tgctgggatg gcgcatgtcc 
ggggcctgat ggccaggaag tgacttgtcg 
ggacctgctt ggcctgggcc tggtagagcc 
ccagagcagg cgctgcagga agaatgcctt 
ccacagccac ggggtttgca atagcaacca 
acccttctgt gacaagccag gctttggtgg 



ccatctgcag cgatgcagct ggggctgcag 60 
tggcccacac agggaccacc ccctgggcgg 120 
cactggacag ccctggcccc tggaccctga 180 
gaagcctctg ccagcagtct cgcctgaccc 240 
ctgcctctgg tgagaggtag ctcctaaaat 3 00 

324 



cacgctggag cagcagtgcc agcagctctg 60 
gggagccttg gcactcccca gtgcccagct 12 0 
aggcacccag tgtggaccta gaatggtgtg 18 0 
ccaggagctt cagcgctgcc tgactgcctg 24 0 
taactgccac tgtgctccag gctgggctcc 300 
cagcatggac agtggccctg tgcaggctga 360 
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aaaccatgac accttcctgc tggccatgct 
ggccggcctg gcctggtgtt gctaccgact 
gggctgcaga agggaccctg cgtgcagtgg 
cctgggcggc gttcacccca tggagttggg 
ggaccctgag aactctcatg agcccagcag 
gcctgacccc caagcagatc aagtccagat 
tcctaaaatg aacagattta aagacaggtg 



151/154 

cctcagcgtc ctgctgcctc tgctcccagg 420 
cccaggagcc catctgcagc gatgcagctg 4 80 
ccccaaagat ggcccacaca gggaccaccc 54 0 
ccccacagcc actggacagc cctggcccct 600 
ccaccctgag aagcctctgc cagcagtctc 660 
gccaagatcc tgcctctggt gagaggtagc 72 0 
gcc 753 



<210> 361 

<211> 1154 

<212> DMA 

<213> Homo sapiens 



<400> 361 

cagtggctac tgctgggatg gcgcatgtcc 
ggggcctggc tcccacccag ctcccgaggc 
tgctcatgga aactgcggcc aggacagcga 
tgccctgtgt gggaagctgc agtgccaggg 
ggtgccagtg gactctaccg ttcacctaga 
ggcactcccc agtgcccagc tggacctgct 
gtgtggacct agaatggtgt gccagagcag 
tcagcgctgc ctgactgcct gccacagcca 
ctgtgctcca ggctgggctc cacccttctg 
cagtggccct gtgcaggctg aaaaccatga 
cctgctgcct ctgctcccag gggccggcct 
ccatctgcag cgatgcagct ggggctgcag 
tggcccacac agggaccacc ccctgggcgg 
cactggacag ccctggcccc tggaccctga 
gaagcctctg ccagcagtct cgcctgaccc 
cctctggtga gaggtagctc ctaaaatgaa 
cactccagga acttgaactg caggggcaga 
caggcagctt ggaagtttct tccccgagtg 
gagccacatt agaagttcct gagggctgga 
taaaccatca gtcc 



cacgctggag cagcagtgcc agcagctctg 60 
ctgtttccag gtggtgaact ctgcgggaga 12 0 
gggccacttc ctgccctgtg cagggaggga 18 0 
tggaaagccc agcctgctcg caccgcacat 240 
tggccaggaa gtgacttgtc ggggagcctt 3 00 
tggcctgggc ctggtagagc caggcaccca 360 
gcgctgcagg aagaatgcct tccaggagct 420 
cggggtttgc aatagcaacc ataactgcca 480 
tgacaagcca ggctttggtg gcagcatgga 54 0 
caccttcctg ctggccatgc tcctcagcgt 600 
ggcctggtgt tgctaccgac tcccaggagc 660 
aagggaccct gcgtgcagtg gccccaaaga 720 
cgttcacccc atggagttgg gccccacagc 780 
gaactctcat gagcccagca gccaccctga 84 0 
ccaagatcaa gtccagatgc caagatcctg 900 
cagatttaaa gacaggtggc cactgacagc 960 
gccagtgaat caccggacct ccagcacctg 102 0 
gagcttcgac ccacccactc caggaaccca 1080 
gaacactgct gggcacactc tccagctcaa 1140 

1154 



<210> 362 
<211> 953 
<212> DNA 
<213> Homo sapiens 



<400> 362 

cagtggctac tgctgggatg gcgcatgtcc 
ggggcctgat ggccaggaag tgacttgtcg 
ggacctgctt ggcctgggcc tggtagagcc 
ccagagcagg cgctgcagga agaatgcctt 
ccacagccac ggggtttgca atagcaacca 
acccttctgt gacaagccag gctttggtgg 
aaaccatgac accttcctgc tggccatgct 
ggccggcctg gcctggtgtt gctaccgact 
gggctgcaga agggaccctg cgtgcagtgg 
cctgggcggc gttcacccca tggagttggg 
ggaccctgag aactctcatg agcccagcag 
gcctgacccc caagatcaag tccagatgcc 
taaaatgaac agatttaaag acaggtggcc 
aggggcagag ccagtgaatc accggacctc 
ccccgagtgg agcttcgacc cacccactcc 



cacgctggag cagcagtgcc agcagctctg 60 
gggagccttg gcactcccca gtgcccagct 120 
aggcacccag tgtggaccta gaatggtgtg 180 
ccaggagctt cagcgctgcc tgactgcctg 240 
taactgccac tgtgctccag gctgggctcc 300 
cagcatggac agtggccctg tgcaggctga 360 
cctcagcgtc ctgctgcctc tgctcccagg 420 
cccaggagcc catctgcagc gatgcagctg 480 
ccccaaagat ggcccacaca gggaccaccc 540 
ccccacagcc actggacagc cctggcccct 600 
ccaccctgag aagcctctgc cagcagtctc 660 
aagatcctgc ctctggtgag aggtagctcc 72 0 
actgacagcc actccaggaa cttgaactgc 780 
cagcacctgc aggcagcttg gaagtttctt 840 
aggaacccag agccacatta gaagttcctg 900 
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agggctggag aacactgctg ggcacactct ccagctcaat aaaccatcag tec 953 



<210> 363 
<211> 812 
<212> PRT 
<213> Homo sapiens 

<400> 363 

Met Gly Trp Arg Pro Arg Arg Ala Arg Gly Thr Pro Leu Leu Leu Leu 
15 10 15 

Leu Leu Leu Leu Leu Leu Trp Pro Val Pro Gly Ala Gly Val Leu Gin 
20 25 30 

Gly His lie Pro Gly Gin Pro Val Thr Pro His Trp Val Leu Asp Gly 
35 40 45 

Gin Pro Trp Arg Thr Val Ser Leu Glu Glu Pro Val Ser Lys Pro Asp 
50 55 60 

Met Gly Leu Val Ala Leu Glu Ala Glu Gly Gin Glu Leu Leu Leu Glu 
65 70 75 80 

Leu Glu Lys Asn His Arg Leu Leu Ala Pro Gly Tyr He Glu Thr His 
85 90 95 

Tyr Gly Pro Asp Gly Gin Pro Val Val Leu Ala Pro Asn His Thr Asp 
100 105 110 

His Cys His Tyr Gin Gly Arg Val Arg Gly Phe Pro Asp Ser Trp Val 
115 120 125 

Val Leu Cys Thr Cys Ser Gly Met Ser Gly Leu He Thr Leu Ser Arg 
130 135 140 

Asn Ala Ser Tyr Tyr Leu Arg Pro Trp Pro Pro Arg Gly Ser Lys Asp 
145 150 155 160 

Phe Ser Thr His Glu He Phe Arg Met Glu Gin Leu Leu Thr Trp Lys 
165 170 175 

Gly Thr Cys Gly His Arg Asp Pro Gly Asn Lys Ala Gly Met Thr Ser 
180 185 190 

Leu Pro Gly Gly Pro Gin Ser Arg Gly Arg Arg Glu Ala Arg Arg Thr 
195 200 205 

Arg Lys Tyr Leu Glu Leu Tyr He Val Ala Asp His Thr Leu Phe Leu 
210 215 220 

Thr Arg His Arg Asn Leu Asn His Thr Lys Gin Arg Leu Leu Glu Val 
225 230 235 240 



Ala Asn Tyr Val Asp Gin Leu Leu Arg Thr Leu Asp He Gin Val Ala 
245 250 255 
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Leu Thr Gly Leu Glu Val Trp Thr Glu Arg Asp Arg Ser Arg Val Thr 
260 265 270 

Gin Asp Ala Asn Ala Thr Leu Trp Ala Phe Leu Gin Trp Arg Arg Gly 
275 280 285 

Leu Trp Ala Gin Arg Pro His Asp Ser Ala Gin Leu Leu Thr Gly Arg 
290 295 300 

Ala Phe Gin Gly Ala Thr Val Gly Leu Ala Pro Val Glu Gly Met Cys 
305 310 315 320 

Arg Ala Glu Ser Ser Gly Gly Val Ser Thr Asp His Ser Glu Leu Pro 
325 330 335 

lie Gly Ala Ala Ala Thr Met Ala His Glu He Gly His Ser Leu Gly 
340 345 350 

Leu Ser His Asp Pro Asp Gly Cys Cys Val Glu Ala Ala Ala Glu Ser 
355 360 365 

Gly Gly Cys Val Met Ala Ala Ala Thr Gly His Pro Phe Pro Arg Val 
370 375 380 

Phe Ser Ala Cys Ser Arg Arg Gin Leu Arg Ala Phe Phe Arg Lys Gly 
385 390 395 400 

Gly Gly Ala Cys Leu Ser Asn Ala Pro Asp Pro Gly Leu Pro Val" Pro 
405 410 415 

Pro Ala Leu Cys Gly Asn Gly Phe Val Glu Ala Gly Glu Glu Cys Asp 
420 425 430 

Cys Gly Pro Gly Gin Glu Cys Arg Asp Leu Cys Cys Phe Ala His Asn 
435 440 445 

Cys Ser Leu Arg Pro Gly Ala Gin Cys Ala His Gly Asp Cys Cys Val 
450 455 460 

Arg Cys Leu Leu Lys Pro Ala Gly Ala Leu Cys Arg Gin Ala Met Gly 
465 470 475 480 

Asp Cys Asp Leu Pro Glu Phe Cys Thr Gly Thr Ser Ser His Cys Pro 
485 490 495 

Pro Asp Val Tyr Leu Leu Asp Gly Ser Pro Cys Ala Arg Gly Ser Gly 
500 505 510 

Tyr Cys Trp Asp Gly Ala Cys Pro Thr Leu Glu Gin Gin Cys Gin Gin 
515 520 525 

Leu Trp Gly Pro Gly Ser His Pro Ala Pro Glu Ala Cys Phe Gin Val 
530 535 540 

Val Asn Ser Ala Gly Asp Ala His Gly Asn Cys Gly Gin Asp Ser Glu 
545 550 555 560 
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Gly His Phe Leu Pro Cys Ala Gly Arg Asp Ala Leu Cys Gly Lys Leu 
565 570 575 



Gin Cys Gin Gly Gly Lys Pro Ser Leu Leu Ala Pro His Met Val ' Pro 
580 585 590 

Val Asp Ser Thr Val His Leu Asp Gly Gin Glu Val Thr Cys Arg Gly 
595 600 605 

Ala Leu Ala Leu Pro Ser Ala Gin Leu Asp Leu Leu Gly Leu Gly Leu 
610 615 620 

Val Glu Pro Gly Thr Gin Cys Gly Pro Arg Met Val Cys Gin Ser Arg 
625 630 635 ~ 640 

Arg Cys Arg Lys Asn Ala Phe Gin Glu Leu Gin Arg Cys Leu Thr Ala 
645 650 655 

Cys His Ser His Gly Val Cys Asn Ser Asn His Asn Cys His Cys Ala 
660 665 670 



Pro Gly Trp Ala Pro Pro Phe Cys Asp Lys Pro Gly Phe Gly Gly Ser 

675 680 685 

Met Asp Ser Gly Pro Val Gin Ala Glu Asn His Asp Thr Phe Leu Leu 

690 695 700 



Ala Met Leu Leu Ser Val Leu Leu Pro Leu Leu Pro Gly Ala Gly Leu 
705 710 715 720 

Ala Trp cys Cys Tyr Arg Leu Pro Gly Ala His Leu Gin Arg Cys Ser 
725 730 735 



Trp Gly Cys Arg Arg Asp Pro Ala Cys Ser Gly Pro Lys Asp Gly Pro 
740 745 750 

His Arg Asp His Pro Leu Gly Gly Val His Pro Met Glu Leu Gly Pro 
755 760 765 



Thr Ala Thr Gly Gin Pro Trp Pro Leu Asp Pro Glu Asn Ser His Glu' 
770 775 780 

Pro Ser Ser His Pro Glu Lys Pro Leu Pro Ala Val Ser Pro Asp Pro 
785 790 795 800 

Gin Asp Gin Val Gin Met Pro Arg Ser Cys Leu Trp 
805 810 



